Trusted answers to developer questions

What is Apache Storm?

Free System Design Interview Course

Many candidates are rejected or down-leveled due to poor performance in their System Design Interview. Stand out in System Design Interviews and get hired in 2024 with this popular free course.

Apache Storm is a real-time, distributed computation system commonly used in Big Data Analytics. It is free and open-source.

Apache Storm is famous for its unprecedented speed. It has clocked over a million tuples processed per second per node, which makes it considerably faster than Apache Spark.

Apache Storm focuses on scalability, fault-tolerance, and guarantees that your data will be processed. Apache is easy to set up and operate and can be used with any programming language.

Uses

Apache Storm has many use cases:

  • real-time analytics
  • online machine learning
  • continuous computation
  • distributed RPC
  • Extract, transform, load (ETL)
  • and more

Topology

Apache Storm reads a raw stream of real-time data from one end through a “spout.” The spout converts the input into a stream of tuples. This stream then passes through a sequence of small processing units called “bolts,” which output the other end’s processed information.

Apache Storm topology
Apache Storm topology

RELATED TAGS

general

CONTRIBUTOR

Nouman Abbasi
Copyright ©2024 Educative, Inc. All rights reserved
Did you find this helpful?