AI - Project Nessie

Empowering efficient and reliable data lake management through Project Nessie's Git-like catalog solution based on Iceberg format.

Logo of Project Nessie
Last Audited At

About Project Nessie

Project Nessie is a Transactional Catalog for Data Lakes, developed with Git-like semantics. It provides a platform for managing and versioning data lake metadata using the Iceberg format. Project Nessie's offerings include compatibility with various versions of Iceberg, Spark, Hive, Flink, and Presto, as well as Trino. Their mission is to make managing large-scale data lakes more efficient and reliable by offering a versioned catalog solution. They use Git-like semantics to enable rollbacks, branching, merging, and other Git operations on metadata, making it easier for users to collaborate and manage their data lake infrastructure. Project Nessie's offerings are built using open-source technologies and are available through various repositories such as Maven Central, PyPI, Quay.io Docker, and Artifact Hub. The platform is compatible with different Iceberg versions and Spark, Hive, Flink, Presto, and Trino variations, ensuring versatility and wide applicability.

Was this page helpful?

More companies

Estuary Flow

Continuous Data Synchronization Across Systems

Read more

Scortex

Automating quality control with AI-powered visual inspection.

Read more

ClickHouse

Discover ClickHouse, the open-source database system for analyzing large datasets in real-time. Known for efficiency and flexibility.

Read more

Tell us about your project

Our Hubs

London, United Kingdom

A global AI hotspot, thrives on innovation, diverse talent, and a dynamic tech ecosystem, offering unparalleled opportunities for AI engineers.

Munich, Germany

A vibrant AI hub, merges cutting-edge technology with rich cultural experiences, creating an inspiring environment for AI engineers.