HomeCoursesData Science in Production: Building Scalable Model Pipelines
AI-powered learning
Save

Data Science in Production: Building Scalable Model Pipelines

Gain insights into building scalable data and model pipelines, explore different cloud environments, delve into streaming workflows, and discover essential tools for creating real-time data products.

4.6
110 Lessons
4 Cloud Labs
8h
Updated 1 month ago
Join 3 million developers at
Join 3 million developers at
LEARNING OBJECTIVES
  • Identify cloud tools (AWS/GCP) to build scalable batch and streaming model workflows
  • Set up remote Jupyter on AWS EC2 with SSH and firewall rules for cloud development
  • Load and automate dataset ingestion into Pandas from BigQuery and Kaggle using Python APIs
  • Build and evaluate prototype models using scikit-learn, PySpark MLlib, and Keras (ROC/AUC, errors)
  • Deploy models as web endpoints using Flask, Gunicorn, Heroku, and Dash for real-time predictions
  • Implement serverless model endpoints with GCP Cloud Functions and AWS Lambda using S3/GCS and IAM
  • Construct batch and streaming pipelines with PySpark, Apache Beam/Dataflow, Kafka, and PubSub

Learning Roadmap

110 Lessons16 Quizzes4 Cloud Labs

3.

Models as Serverless Functions

Models as Serverless Functions

17 Lessons

17 Lessons

Go hands-on with deploying machine learning models using serverless functions on AWS and GCP.

4.

Containers for Reproducible Models

Containers for Reproducible Models

9 Lessons

9 Lessons

Grasp the fundamentals of using containers to ensure reproducible, scalable model deployments.

5.

Workflow Tools for Model Pipelines

Workflow Tools for Model Pipelines

10 Lessons

10 Lessons

Take a closer look at managing and automating model pipelines with workflow tools like Airflow.

6.

PySpark for Batch Pipelines

PySpark for Batch Pipelines

25 Lessons

25 Lessons

Follow the process of building scalable model pipelines using PySpark in cloud environments.

7.

Cloud Dataflow for Batch Modeling

Cloud Dataflow for Batch Modeling

8 Lessons

8 Lessons

Master the creation and execution of scalable Cloud Dataflow pipelines for batch modeling.

8.

Streaming Model Workflows

Streaming Model Workflows

10 Lessons

10 Lessons

Learn how to use streaming platforms for scalable real-time machine learning pipelines.
Certificate of Completion
Showcase your accomplishment by sharing your certificate of completion.
Fahim Ul HaqData Science in Production:Building Scalable Model PipelinesFounder & CEO
Developed by MAANG Engineers
ABOUT THIS COURSE
The goal of this course is to provide you with a set of tools that can be used to build predictive model services for product teams. In this course, you’ll start by covering the different cloud environments and tools for building scalable data and model pipelines. You’ll then learn the different data sets and types of models that will be used heavily in everyday production. Throughout the course, you’ll have plenty of exercises and challenges to get you comfortable working with the diverse toolset. Lastly, you’ll explore streaming model workflows which is crucial for building real-time data pipelines that move data between different components in a cloud environment. After working through this course, you will have gained valuable hands-on experience with many of the tools needed to build data products. You will also have a better understanding of how to build scalable machine learning pipelines in a cloud environment.
ABOUT THE AUTHOR

Ben Weber

Data Science at Zynga, UCSC Alum

Learn more about Ben

Trusted by 3 million developers working at companies

Built for 10x Developers

No Passive Learning
Learn by building with project-based lessons and in-browser code editor
Learn by Doing
Personalized Roadmaps
The platform adapts to your strengths & skills gaps as you go
Learn by Doing
Future-proof Your Career
Get hands-on with in-demand skills
Learn by Doing
AI Code Mentor
Write better code with AI feedback, smart debugging, and "Ask AI"
Learn by Doing
Learn by Doing
MAANG+ Interview Prep
AI Mock Interviews simulate every technical loop at top companies
Learn by Doing

Free Resources

FOR TEAMS

Interested in this course for your business or team?

Unlock this course (and 1,000+ more) for your entire org with DevPath