Search⌘ K
AI Features

Data Preprocessing: Identifiers

Explore the role of identifiers in data preprocessing for binary classification. Understand how unique identifiers like passenger IDs and names can cause overfitting. Learn to detect these columns and remove them to improve your model's ability to generalize to new data.

Identifiers

The goal of machine learning is to create an algorithm that can predict data. Or, as we said before: to put a label on a thing. While we use already labeled data when building the algorithm, the goal is to predict labels we don’t know yet.

We don’t tell the algorithm how it can decide which label to select. Instead, we give the algorithm the data and let it figure it out. That being said, an intelligent ...