ETL Transformation Example: Handling Missing Values and Data
Explore practical ETL data transformation techniques focused on handling missing and null values. Learn how to delete rows with nulls, replace missing numeric values, and clean empty rows using shell scripts to improve data quality in ETL pipelines.
We'll cover the following...
We'll cover the following...
Task 3: Delete rows with null values
As we might’ve noticed, some rows had null values. According to the data scientist, all rows with null values must be dropped.
We’ve created a file called transform_data_3_null.sh to perform these operations. Let’s walk through the code line by line.
The script contains two functions,
extract_nulls()anddelete_nulls().Line 8: We create an empty text file called
null_values.txt.Line 11: We append rows with null values from ...