About the Course

Get a brief introduction to what you’ll learn in this course.

We'll cover the following

Target audience

The main audience for this course is Java developers, so an intermediate knowledge of Java is necessary. The learner should also be familiar with object-oriented programming (OOP). However, if you are not previously familiar with OOP, this course introduces and exemplifies the contents clearly, and the relevant parts of the code are explained in detail.

Knowledge of big data and batch processing patterns is not a requirement for this course—we cover them through theory, illustrations, code examples, and much more!

If you’re enthusiastic about learning one of the most popular frameworks in the market for big data processing, are here to consolidate your experience, or simply want to acquire a much-valued skill in the information era, you are in the right place!

Course layout

The main goal of this course is to present a simple framework to learn both the theory and the practice that complements it.

To start this enterprise with the right foot and build a solid foundation of Spark this course is structured in the following way:

  • Chapters 2 and 3 focus both on the theory and general definitions surrounding Spark. They cover how Spark projects are usually structured in Java, alongside the potential tools needed to develop them in the real world. The first two Spark code examples presented in these chapters serve as a gentle introduction to the API.

  • Chapters 4, 5, and 6 expand on the DataFrame data structure, data structure, using an abstraction to work with datasets of information, which follows a practical coding approach. These chapters include explanations and complementary theory to offer a complete understanding of the topics at hand.

  • Chapter 7 is all about developing a comprehensive template for a Spark-based big data batch processing application. The goal of this chapter is to provide developers with an initial view of what a real-world application looks like.

  • Chapters 8 and 9 concern Spark deployment, execution, and monitoring.

  • Chapter 10 offers information on how to keep building knowledge about Spark.

Note: Most of the lessons follow a similar structure. An initial conceptual part serves as an introduction to a topic, including diagrams and graphics, followed by hands-on coding widgets that are explained through a detailed code walkthrough.