AI - Kedro

Open-source DSML framework empowers data scientists/engineers: build, version, deploy reproducible pipelines via catalog-based approach, modularity, parallel exec., integrations.

Logo of Kedro
Last Audited At

About Kedro

Kedro is an open-source Data Science and Machine Learning (DSML) framework developed in Python. The project's mission is to help data scientists and engineers build, version, and deploy reproducible DSML pipelines.

Kedro provides a catalog-based approach for organizing and managing DSML projects, enabling modularity, extensibility, and collaboration among team members. They offer various features such as:

  1. Project organization: Kedro helps manage the project's structure with a clear separation of code, data, and configuration.
  2. Catalog-based approach: A catalog is a collection of nodes that represent tasks, transformations, or actions in your DSML pipeline. This hierarchical design allows for easy composition and reuse of components.
  3. Data Catalog: Kedro includes a built-in data catalog to store metadata about input and output datasets, allowing users to easily access their data within the pipeline.
  4. Modularity and Composability: DSML pipelines can be constructed as modular units that can be reused across multiple projects. This leads to increased code maintainability, testability, and reusability.
  5. Parallel Execution: Kedro allows for parallel execution of tasks within a pipeline, which can significantly reduce the overall runtime of your DSML workflows.
  6. Versioning: Kedro supports version control for pipelines and catalogs using Git or other version control systems. This ensures that teams can work on different branches of their projects simultaneously while maintaining data consistency.
  7. Integrations and Extensibility: Kedro offers integrations with various popular data science libraries and tools such as NumPy, Pandas, TensorFlow, Scikit-learn, and more. It also allows users to easily extend its functionality through custom plugins.
  8. Community and Partnerships: Kedro has an active community on Slack and GitHub, with over 2,300 members as of March 2023. They have received recognition from Core Infrastructure Initiative, PyPI, Anaconda, and PEP 517. These collaborations contribute to the ongoing development, maintenance, and adoption of Kedro within the DSML community.

More companies

Meltano

Meltano streamlines data integration: Extract, transform, load data from multiple sources to destinations. Detailed logs ensure seamless collaboration and data security.

Read more

Involve AI

Transforming sales processes with intuitive AI-powered platform, R2D2, for real-time lead enrichment, personalized outreach, and industry trend analysis.

Read more

cnvrg.io

Empowering businesses with a comprehensive AI solution through industry-leading partnerships and an easy-to-use platform by cnvrg.io.

Read more

Tell us about your project

Our Hubs

London, United Kingdom

A global AI hotspot, thrives on innovation, diverse talent, and a dynamic tech ecosystem, offering unparalleled opportunities for AI engineers.

Munich, Germany

A vibrant AI hub, merges cutting-edge technology with rich cultural experiences, creating an inspiring environment for AI engineers.