This device is not compatible.
You will learn to:
Build a complete machine learning pipeline from data generation to deployment.
Preprocess and balance datasets using scaling techniques and SMOTE.
Train and evaluate a Random Forest model for churn prediction.
Deploy a machine learning model using FastAPI for real-time inference.
Design a production-ready API that returns business actionable insights.
Skills
Data Engineering
Machine Learning
API Development
Prerequisites
Basic knowledge of Python
Programming familiarity with pandas and NumPy
Understanding of machine learning fundamentals
Basic knowledge of classification models
Technologies
Python
FastAPI
Project Description
In this project, we’ll build a churn prevention system that uses machine learning to identify customers likely to churn. Customer churn is a major challenge for subscription-based businesses, and we address it by designing an end-to-end pipeline that turns raw customer behavior data into retention signals teams can act on. We’ll use Python data science tools, including pandas, scikit-learn, SMOTE, and FastAPI, to build a machine learning workflow that covers data preparation, model training, evaluation, and deployment.
We’ll start by generating and preprocessing customer data, where we clean features, scale numerical values, and handle class imbalance to improve model learning. We then train a Random Forest classifier to detect churn patterns and evaluate its performance using standard classification metrics. Finally, we’ll deploy the trained model using FastAPI, allowing real-time predictions through a REST API that returns both churn probability and business-driven retention strategies. This project gives us hands-on experience in building production-ready machine learning systems that directly support data-driven decision-making.
Project Tasks
1
Data Engineering and Preprocessing
Task 0: Get Started
Task 1: Import Libraries and Modules
Task 2: Load and Prepare the Baseline Churn Data
Task 3: Implement Standardization and Handle Class Imbalance
2
Model Development and Evaluation
Task 4: Train and Evaluate the Random Forest Ensemble
Task 5: Serialize the Model and Preprocess Assets
3
API Deployment
Task 6: Deploy Production-Ready Inference AP
Task 7: Finalize and Export the Model
Congratulations!
Subscribe to project updates
Atabek BEKENOV
Senior Software Engineer
Pradip Pariyar
Senior Software Engineer
Renzo Scriber
Senior Software Engineer
Vasiliki Nikolaidi
Senior Software Engineer
Juan Carlos Valerio Arrieta
Senior Software Engineer
Relevant Courses
Use the following content to review prerequisites or explore specific concepts in detail.