AI - Samza

Empowering real-time data processing at scale through Apache Samza's distributed framework utilizing Apache Kafka and Apache Hadoop YARN.

Logo of Samza
Last Audited At

About Samza

Samza is a top-level project of the Apache Software Foundation that develops a distributed stream processing framework. It utilizes Apache Kafka for messaging and Apache Hadoop YARN for fault tolerance, processor isolation, security, and resource management.

Key features of Samza include:

  • Uses Apache Kafka for messaging
  • Leverages Apache Hadoop YARN for distributed processing
  • Provides real-time data processing at scale
  • Offers high availability and fault tolerance
  • Supports various programming models such as MapReduce, Session Windows, and Triggers

Samza can be built using Gradle and supports contribution from the developer community following specific guidelines. It is a popular open-source project under the Apache umbrella with a large and active user base.

Was this page helpful?

More companies

Rivery

Empowering businesses with seamless AI-powered integrations and data orchestration from leading ecosystems.

Read more

Owkin

Revolutionizing healthcare with AI for accurate diags, personalized treatments, and improved outcomes. A federated research network ensuring data privacy & collaboration.

Read more

Bytewax

Empowering developers to build complex event-driven workflows using AI technologies with Bytewax's open-source Dataflow engine and flexible tools for real-time data processing and transformation.

Read more

Tell us about your project

Our Hubs

London, United Kingdom

A global AI hotspot, thrives on innovation, diverse talent, and a dynamic tech ecosystem, offering unparalleled opportunities for AI engineers.

Munich, Germany

A vibrant AI hub, merges cutting-edge technology with rich cultural experiences, creating an inspiring environment for AI engineers.