Introduction to Distributed Systems for Dummies/

...

Stream Processing

Understand what stream processing is and where it is used.

We'll cover the following...

How to handle unbounded data
What is stream processing?
How soon should we process the data
Stream processing mechanism
Key-takeaways

Press + to interact

Now, the input is somewhat bounded. We assumed that we had all the text of the English literature. All we had to do is to run the MapReduce algorithm on top of the data and gather results.

Now the important question.

What if the data is unbounded?

How to handle unbounded data

In a real life system where we need data processing, data is almost always unbounded. Let’s quickly discuss an example.

Assume the engineers at Instagram decided to analyze user behaviors on Instagram videos. They will want to track user behavior while using the app to understand how users use the app or interact with some content. The ...

Introduction

What Distributed Systems Achieve for Us

Data in Distributed Systems

Communication Between Nodes

Data Processing in Large Scale

Distributed System Architectural Patterns

Case Study 1: Apache Spark

Case Study 2: Apache Druid

Conclusion

Stream Processing

How to handle unbounded data