AI - Parquet

Provides efficient columnar storage & simplified processing for large datasets via open-source Parquet tech. Committed to coding standards, community collaboration, and Apache licensing.

Logo of Parquet
Last Audited At

About Parquet

Parquet is an open-source data storage technology developed under the Apache Software Foundation. It focuses on delivering efficient, columnar storage solutions for large datasets. Parquet's primary goal is to make data analysis faster by providing optimized I/O and compression.

Parquet offers services that simplify data processing when using platforms like Hadoop or Spark. The company prides itself on strict adherence to coding standards, ensuring readability and maintainability of the codebase. Their key offerings include:

  1. Pig compatibility: Parquet's implementation of Pig Latin, the programming language for defining data flow graphs, allows for seamless integration with tools like Apache Pig and Hive.
  2. Schema conversion: Parquet offers automatic schema conversion for certain data storage methods to make the process easier for developers.
  3. Contributor community: The project boasts a dedicated community of authors and contributors that collaborate on enhancing and expanding Parquet's capabilities.
  4. Code of conduct: Parquet adheres to two codes of conduct - Apache Software Foundation's and Twitter's, ensuring a positive and inclusive development environment.
  5. Licensing: Parquet is licensed under the Apache License, Version 2.0.

To contribute to Parquet, one can send pull requests against the official Git repository at https://github.com/apache/parquet-mr. The project encourages contributions in various forms, including code improvements and issue reporting.

Was this page helpful?

More companies

Boomi

Streamlining digital transformation with a comprehensive iPaaS platform for real-time application and data integration, enhanced by Boomi AI.

Read more

Lytics

Conductor: Top-tier AI solutions & integrated offerings for business uncertainty, challenges, & optimization via our platform & Decision Engine. Trusted by industry leaders.

Read more

Recursion

Industrializing drug discovery through technology synchronization of hardware, software, and data at Recursion" or "Synchronizing technology for industrialized drug discovery at Recursion.

Read more

Tell us about your project

Our Hubs

London, United Kingdom

A global AI hotspot, thrives on innovation, diverse talent, and a dynamic tech ecosystem, offering unparalleled opportunities for AI engineers.

Munich, Germany

A vibrant AI hub, merges cutting-edge technology with rich cultural experiences, creating an inspiring environment for AI engineers.