Building a Machine Learning Pipeline from Scratch/

...

The Complete Pipeline

Put all the components together to complete the pipeline.

We'll cover the following...

The factory design pattern
- Dataset factory
- Model factory
Putting it all together
- Try it yourself

We now have the following components of our pipeline:

The pipeline core
- Argument parsing
- Artifacts and their versioning
- Logging
The ML library
- The dataset module
- The model module
- Report generation

We need two more pieces of code before we can run the complete pipeline. Both of these conform to the factory design pattern.

The factory design pattern

In software engineering, a design pattern is a particular way to solve a frequently encountered problem. This will become clear when we discuss the factory design pattern and how it applies to datasets and models in our pipeline.

We’ve seen the abstract base class Dataset, from which we derived IrisDataset. We can use this class directly in our code, as shown below.

Press + to interact

Introduction

Getting Started

Structuring the ML Pipeline

Directed Acyclic Graphs (DAGs)

The ML Library

Create Your First Data Pipeline with a Dashboard

The Pipeline Core

Extending the Pipeline

Build a News ETL Data Pipeline Using Python and SQLite

Testing

Deployment

Other Considerations

Wrapping Up

Appendix

Final Assessment

The Complete Pipeline

The factory design pattern