AI - Samza

Empowering real-time data processing at scale through Apache Samza's distributed framework utilizing Apache Kafka and Apache Hadoop YARN.

Logo of Samza
Last Audited At

About Samza

Samza is a top-level project of the Apache Software Foundation that develops a distributed stream processing framework. It utilizes Apache Kafka for messaging and Apache Hadoop YARN for fault tolerance, processor isolation, security, and resource management.

Key features of Samza include:

  • Uses Apache Kafka for messaging
  • Leverages Apache Hadoop YARN for distributed processing
  • Provides real-time data processing at scale
  • Offers high availability and fault tolerance
  • Supports various programming models such as MapReduce, Session Windows, and Triggers

Samza can be built using Gradle and supports contribution from the developer community following specific guidelines. It is a popular open-source project under the Apache umbrella with a large and active user base.

More companies

Anyscale

Scalable AI Infrastructure for Modern Workloads

Read more

Riffusion

Revolutionizing music and audio generation with real-time diffusion technology: Empowering creators through innovative solutions and open collaboration.

Read more

dbt Labs

Empowering data teams with open-source dbt for faster, trusted data product deployment through a strong community and valuable resources.

Read more

Tell us about your project

Our Hubs

London, United Kingdom

A global AI hotspot, thrives on innovation, diverse talent, and a dynamic tech ecosystem, offering unparalleled opportunities for AI engineers.

Munich, Germany

A vibrant AI hub, merges cutting-edge technology with rich cultural experiences, creating an inspiring environment for AI engineers.