Log In
0% completed
All Lessons
Free Lessons (3)
Introduction
Getting Started
ETL Pipeline Stages
What Is an ETL Pipeline?
A New Paradigm—ELT
ETL Example—Extraction
ETL Transformation Example: Addressing Data Quality Issue
ETL Transformation Example: Handling Missing Values and Data
ETL Transformation Example: Sorting and Finalizing the Data
ETL Example—Load
ETL Example—Scheduling
Batch vs. Stream Processing
Data Warehouse
Examples and Use Cases
Quiz: ETL Pipelines
E: Extract
Introduction
Data Extraction Methods Overview
Extracting Data with Web Scraping
Web Scraping Exercise: Reading the Data
Web Scraping Exercise: DataFrames to CSV
Extraction Using a REST API
Exercise: Extracting Data with a REST API
Full Extraction From MySQL Database
Incremental Extraction From MySQL Database
Extraction From MySQL’s Binary Log
Extract From PostgreSQL Database
Extraction From Google BigQuery
Exercise: Databases
Quiz: Extracting Data
T: Transform
Introduction
Data Typing and Structuring Using Python
Exercise: Data Typing and Structuring
Anonymizing and Encrypting Using Python
Exercise: Anonymizing and Encrypting the Data
Data Cleaning Using Apache Spark: Missing Data
Data Cleaning Using Apache Spark: Duplicate Data
Exercise: Data Cleaning
Filtering, Sorting, Aggregating, and Binning Using SQL
Exercise: Filtering, Sorting, Aggregating, and Binning
Quiz: Transforming Data
L: Load
Introduction
Hosting and Deployment: On-Premise vs. Cloud-Based
Hosting and Deployment: Open-Sourced vs. Proprietary
Loading Methods: Scheduled vs. On-Demand
Loading Methods: Full Loading vs. Incremental Loading
Loading Data Using the COPY Command
Exercise: The COPY Command
Exercise: Incremental Load
Quiz: Loading Data
Orchestration
Introduction
Deploying Airflow
ETL Pipeline Exercise: Extracting Data
ETL Pipeline Example: Airflow Extraction Task
ETL Pipeline Exercise: Transform
ETL Pipeline Example: Airflow Transform Task
ETL Pipeline Example: Load
Interacting with Airflow
Mini Project
ETL Pipeline: Fraud Detection Preprocessing
Conclusion
Final Thoughts
Project
Premium
Build a News ETL Data Pipeline Using Python and SQLite
Mock interview
Premium
Fraud Detection System