This course includes
Course Overview
This course explores the big data ecosystem, focusing on hands-on utilization of PySpark—the Python API for Apache Spark. In this course, you’ll experience a balanced blend of theory and practice. You’ll learn about data ingestion, storage, distributed computing, PySpark’s intricacies, data processing, data analysis, performance optimization, tool integration, and practical applications like machine learning. This course, suited for beginners to intermediate learners, will give you an understanding of b...
TAKEAWAY SKILLS
Python 3
What You'll Learn
An understanding of the big data ecosystem, including data ingestion, integration methods, and big data storage options
A working knowledge of distributed computing fundamentals, covering parallel processing, partitioning strategies, and load balancing methodologies
The ability to utilize PySpark for diverse data operations, including processing, transformation, and analysis
Familiarity with basic and advanced data types, Spark SQL, machine learning algorithms, and data mining within PySpark
A working knowledge of PySpark's integration capabilities with various big data tools, such as Hadoop, Kafka, Hive, and others
What You'll Learn
An understanding of the big data ecosystem, including data ingestion, integration methods, and big data storage options
Show more
Course Content
Introduction to the Course
Introduction to Big Data
Exploring PySpark Core and RDDs
PySpark DataFrames and SQL
Customer Churn Analysis Using PySpark
Machine Learning with PySpark
6 Lessons
Modeling with PySpark MLlib
5 Lessons
Predicting Diabetes in Patients Using PySpark MLlib
3 Lessons
Performance Optimization in PySpark
5 Lessons
PySpark Optimization: Analyzing NYC Restaurants Data
3 Lessons
Integrating PySpark with Other Big Data Tools
4 Lessons
Wrap Up
1 Lesson
Course Author
Trusted by 1.4 million developers working at companies
Anthony Walker
@_webarchitect_
Emma Bostian 🐞
@EmmaBostian
Evan Dunbar
ML Engineer
Carlos Matias La Borde
Software Developer
Souvik Kundu
Front-end Developer
Vinay Krishnaiah
Software Developer
Eric Downs
Musician/Entrepeneur
Kenan Eyvazov
DevOps Engineer
Souvik Kundu
Front-end Developer
Eric Downs
Musician/Entrepeneur
Anthony Walker
@_webarchitect_
Emma Bostian 🐞
@EmmaBostian
See how Educative uses AI to make your learning more immersive than ever before.
Instant Code Feedback
AI-Powered Mock Interviews
Adaptive Learning
Explain with AI
AI Code Mentor