AI Product Engineer Logo

Command Palette

Search for a command to run...

Back to AI Ecosystem

OpenLineage

Empowering organizations to trace and understand the complete data lineage and provenance journey across systems with OpenLineage's open-source tools and active community.

OpenLineage logo
Open Source Infrastructure

OpenLineage is an open-source project developing tools for data lineage and provenance, providing a platform to help organizations manage and trace data across systems with active development and Apache 2.0 license. Notably, they offer a Java library, integrations, and contribute across various channels, presenting at events like Data Engineering Summit and Berlin Buzzwords.

About OpenLineage

OpenLineage is an open-source project that develops tools and technologies for data lineage and provenance. They provide a community-driven platform designed to help organizations manage and trace the lifecycle of their data across various systems and workflows. The project is actively developed, as indicated by its CircleCI badge, and is licensed under the Apache 2.0 license.

OpenLineage's mission is to enable better understanding of how data moves and evolves throughout the data pipeline. They offer services including a Java library, which can be accessed through Maven Central, and various integrations with different platforms. Their contributions span across multiple channels, such as GitHub, Slack, Twitter, LinkedIn, YouTube, Mastodon, and their mailing list.

Notable achievements for OpenLineage include presentations at events like the Data Engineering Summit, Berlin Buzzwords, and COSS Conference. They have also been recognized on platforms like Coss Community and Databricks Data & AI Summit. The project is committed to fostering a vibrant community by encouraging open collaboration and contribution through their contributing guide and vulnerability reporting process.