Introduction
Explore core data extraction techniques from relational and non-relational databases, APIs, web scraping, and cloud repositories. Understand different data formats and how to handle structured, semi-structured, and unstructured data. This lesson prepares you to build versatile ETL pipelines by mastering extraction from diverse data sources.
We'll cover the following...
In today's world, data is everywhere. It’s constantly generated and stored in various sources. To effectively transfer data, it’s essential to first learn how to extract it.
This section introduces a range of techniques for extracting data from diverse sources such as relational and non-relational databases, cloud data warehouses, APIs, web scraping, and more. These skills will help us build ETL pipelines capable of extracting data from various sources in different formats and for multiple purposes.
Data sources
The method for extracting data from each source varies depending on the specific characteristics of the data, the source itself, and the purpose for which we extract the data. ...