Tag Archives: samza.incubator.apache.org

Apache Samza: LinkedIn’s Real-time Stream Processing Framework

  • Samza is a massively scalable framework for distributed stream transport and limited processing
  • Samza uses Yarn and Apache Kafka (publish/subscribe messaging able to handle 100s of MB reads/writes per second)
  • LinkedIn utilizes Samza to publish 26+ billion unique messages per day to 100s of message feeds that are picked up by 1000s of automated subscribers (some are real time, others batch)

Sources: