Data Scrubbing Operation: Removing Variables
Understand how to prepare datasets for machine learning by removing variables that are irrelevant or incompatible with your selected algorithm. Explore using exploratory data analysis and domain knowledge to identify which variables to drop, and learn how to execute this efficiently in Python using the del function without affecting the original data file.
We'll cover the following...
We'll cover the following...
Quick overview
Preparing data for further processing generally starts by removing variables that aren’t compatible with the chosen algorithm or variables that are deemed less relevant to your target output. Determining which variables to remove from the dataset is generally done using exploratory data ...