Data Engineering Foundations in Python

0%
1.Getting Started
Introduction
2.Data Team Structure
Data Team RolesData Team Structure: Embedded or Centralized?Quiz: Data Team Structure
3.Data Engineering Life Cycle
Set Up the Google CloudIntroduction to Data Engineering Life CycleIngestion, Transformation, and VisualizationStorage and InfrastructureQuiz: Data Engineering Life Cycle
4.Cloud Data Architecture
Evolution of Cloud Data ArchitectureCloud Service ModelsCloud Data Architectures: Lambda, Kappa, and Data WarehouseCloud Data Architectures: Data Lake and Data MeshPatterns of Good Cloud Data ArchitectureQuiz: Cloud Data Architecture
5.Data Ingestion
Batch IngestionStream IngestionPush vs. PullIngestion Methods—SFTP, API, and Object StorageIngestion Methods—CDCIngestion Methods—Streaming PlatformIngest with pandasIngest with PySparkQuiz: Data Ingestion
6.Data Modeling
Introduction to Data ModelingKimball’s Dimensional ModelingSteps of Dimensional ModelingDDL vs. DMLQuery Life CycleChallenge: BigQuery SQLSolution: BigQuery SQLQuiz: Data Modeling
7.Data Orchestration
What Is Data Orchestration?Airflow BasicsAirflow DAG DesignChallenge: Build an FXRate Reporting Pipeline with AirflowSolution: Build an FXRate Reporting Pipeline with AirflowOrchestration Tool: DagsterOrchestration Tool: dbtQuiz: Data Orchestration
Project
Premium
Mastering Airflow: Building an ETL Pipeline
8.Data Quality
Data Quality MeasurementData Schema: Avro and ProtobufManage Data Quality in dbtQuiz: Data Quality
Mini Project
Premium
Build an End-to-End Data Pipeline for Formula 1 Analysis
9.Epilogue
The Way Forward
10.Appendix
Additional Resources
Mock Interview
Premium
Data Engineering Fundamentals