AI - Hawq
An open-source data warehousing system with advanced AI technologies for scalable query processing and regulatory compliance, supporting SQL/HiveQL and integrating with Hadoop ecosystem.
- Name
- Hawq - https://github.com/apache/hawq
- Last Audited At
About Hawq
Hawq is an open-source data warehousing system developed under the Apache Software Foundation. It is classified as Export Commodity Control Number (ECCN) 5D002.C.1 by the U.S. Government Department of Commerce, Bureau of Industry and Security, which includes information security software using or performing cryptographic functions with asymmetric algorithms. This makes it eligible for export under the License Exception ENC Technology Software Unrestricted (TSU) exception.
Hawq provides a scalable, distributed query processing system that supports both SQL and HiveQL queries over large datasets. It includes several components such as the test suite (TestHawqRegister, TestTPCH.TestStress, TestHdfsFault, TestZookeeperFault, and TestHawqFault) for registering tests, stress testing, fault testing HDFS, Zookeeper, and Hawq respectively.
The system uses advanced AI technologies to optimize query performance and improve data analytics capabilities. It supports various file formats like ORC, Parquet, and Avro, and integrates with Apache Hadoop's HDFS and Apache ZooKeeper for managing distributed files and coordinating distributed applications respectively.
Hawq adheres to strict export control regulations regarding the use, possession, and re-export of encryption software and follows proper CI (Continuous Integration) processes using Travis CI Build, Apache Release Audit Tool, and Coverity Static Analysis for maintaining high code quality and security standards.