Introduction to Distributed Systems for Dummies/

...

Batch Processing

Get to know the significance of data processing in distributed systems.

We'll cover the following...

Processing massive amounts of data
What is batch processing?
An example of batch processing
Key takeaways

Modern systems are data driven more than ever. It is so obvious that we sometimes fail to notice.

Basically, every interaction you have with a system is driven by data. So, how do we process such large amounts of data?

Processing massive amounts of data

During the last decade, access to the internet has become a norm. Today, almost 60% of the world’s population has access to the internet. This is why we have witnessed a surge in online businesses all over the globe.

Many companies have started to see a larger numbers of users in their systems. And when a system has a massive user base, all these users collectively produce a massive amount of data.

But in the past, it was impossible to leverage all the data in a meaningful way. Think of user interaction on an app. A fairly large system with millions of active users every day can dig up very useful insight from user interaction. But due to the ...

Introduction

What Distributed Systems Achieve for Us

Data in Distributed Systems

Communication Between Nodes

Data Processing in Large Scale

Distributed System Architectural Patterns

Case Study 1: Apache Spark

Case Study 2: Apache Druid

Conclusion

Batch Processing

Processing massive amounts of data