The Machine Learning Process
Explore the essential machine learning process that includes collecting and preparing data, building models using supervised or unsupervised learning, and evaluating model performance. Understand how iterative refinement improves models to solve real-world problems effectively with scikit-learn.
We'll cover the following...
There is a workflow that we can follow in most ML projects. This workflow consists of four major steps that each feature smaller steps:
Data collection
Preprocessing
Modeling
Model evaluation
Although these steps give us the impression of linearity, the process is usually quite iterative. This means we’re constantly going back and forth between steps.
Let’s look at these steps in more detail.
Data collection
Data collection, the initial stage of the ML process, is where raw data is gathered from various sources. This data can come in different forms, including structured and unstructured data, and it may be stored in databases, spreadsheets, text documents, or other formats. The quality and ...