Samza is Apache's open-source stream processing framework for real-time processing of events from Kafka or other message systems. Samza jobs read from message topics, process events (filtering, mapping, aggregating, joining), and write to output topics or external systems. Samza excels at low-latency, high-throughput event processing with exactly-once semantics (no duplicates or losses). Key concepts: stateless operations (simple transformations), stateful operations (aggregations, JOINs using local state stores), windowing (time-based grouping), and checkpointing (fault tolerance). Samza is tightly integrated with Kafka—designed for high-volume event streams.