AI-powered learning
Save this course
Data Science in Production: Building Scalable Model Pipelines
Gain insights into building scalable data and model pipelines, explore different cloud environments, delve into streaming workflows, and discover essential tools for creating real-time data products.
4.6
110 Lessons
4 Cloud Labs
8h
Updated 4 months ago
Join 2.9 million developers at
Join 2.9 million developers at
Learning Roadmap
1.
Introduction to Building Scalable Model Pipelines
Introduction to Building Scalable Model Pipelines
Get familiar with building scalable data pipelines, using Python, clouds, coding environments.
Course OverviewApplied Data SciencePython for Scalable ComputeCloud EnvironmentsCoding EnvironmentsQuiz - Data Science Preliminary Concepts🛑 Important Note!Introduction to Datasets- BigQuery to Pandas- Kaggle to PandasQuiz - Importing DatasetsPrototype Models- Linear Regression- Logistic Regression- Keras RegressionAutomated Feature EngineeringQuiz - Prototype ModelingConclusion : Data science models, tools and environments
2.
Models as Web Endpoints
Models as Web Endpoints
Solve challenges with deploying machine learning models as scalable web endpoints.
3.
Models as Serverless Functions
Models as Serverless Functions
17 Lessons
17 Lessons
Go hands-on with deploying machine learning models using serverless functions on AWS and GCP.
4.
Containers for Reproducible Models
Containers for Reproducible Models
9 Lessons
9 Lessons
Grasp the fundamentals of using containers to ensure reproducible, scalable model deployments.
5.
Workflow Tools for Model Pipelines
Workflow Tools for Model Pipelines
10 Lessons
10 Lessons
Take a closer look at managing and automating model pipelines with workflow tools like Airflow.
6.
PySpark for Batch Pipelines
PySpark for Batch Pipelines
25 Lessons
25 Lessons
Follow the process of building scalable model pipelines using PySpark in cloud environments.
7.
Cloud Dataflow for Batch Modeling
Cloud Dataflow for Batch Modeling
8 Lessons
8 Lessons
Master the creation and execution of scalable Cloud Dataflow pipelines for batch modeling.
8.
Streaming Model Workflows
Streaming Model Workflows
10 Lessons
10 Lessons
Learn how to use streaming platforms for scalable real-time machine learning pipelines.
Certificate of Completion
Showcase your accomplishment by sharing your certificate of completion.
Complete more lessons to unlock your certificate
Developed by MAANG Engineers
ABOUT THIS COURSE
The goal of this course is to provide you with a set of tools that can be used to build predictive model services for product teams.
In this course, you’ll start by covering the different cloud environments and tools for building scalable data and model pipelines. You’ll then learn the different data sets and types of models that will be used heavily in everyday production. Throughout the course, you’ll have plenty of exercises and challenges to get you comfortable working with the diverse toolset.
Lastly, you’ll explore streaming model workflows which is crucial for building real-time data pipelines that move data between different components in a cloud environment.
After working through this course, you will have gained valuable hands-on experience with many of the tools needed to build data products. You will also have a better understanding of how to build scalable machine learning pipelines in a cloud environment.
ABOUT THE AUTHOR
Ben Weber
Data Science at Zynga, UCSC Alum
Trusted by 2.9 million developers working at companies
A
Anthony Walker
@_webarchitect_
E
Evan Dunbar
ML Engineer
S
Software Developer
Carlos Matias La Borde
S
Souvik Kundu
Front-end Developer
V
Vinay Krishnaiah
Software Developer
Built for 10x Developers
No Passive Learning
Learn by building with project-based lessons and in-browser code editor


Personalized Roadmaps
The platform adapts to your strengths & skills gaps as you go


Future-proof Your Career
Get hands-on with in-demand skills


AI Code Mentor
Write better code with AI feedback, smart debugging, and "Ask AI"




MAANG+ Interview Prep
AI Mock Interviews simulate every technical loop at top companies


Free Resources