Introduction

Explore how to process and manipulate numeric data using NumPy, gaining practical skills to prepare datasets for machine learning models and data analysis.

We'll cover the following...

A. Data processing
B. NumPy

A. Data processing

When asked about Google's model for success, Peter Norvig, the director of research at Google, famously stated,

"We don't have better algorithms than anyone else; we just have more data."

Though probably an understatement (given the amount of talent employed at Google), the quote does provide a sense of just how vital data is to having successful outcomes.

People normally discuss the importance of data in the context of machine learning. No matter how sophisticated a machine learning model is, it will not perform well unless it has a reasonable amount of data to train on. On the other hand, given a large and diverse set of training data, a good deep learning model will significantly outperform non-deep-learning algorithms.

However, data is not just limited to machine learning. Companies use data to identify customer trends, political parties use data to determine which demographics they should target, sports teams use data to analyze players, etc.

Example baseball data used in sabermetrics. The concept was popularized by the 2011 film, Moneyball.

The universal usage of data makes data processing, the act of converting raw data into a meaningful form, an essential skill to have.

1.What you'll learn from this course

2.Data Manipulation with NumPy

3.Data Analysis with pandas

4.Data Preprocessing with scikit-learn

5.Data Modeling with scikit-learn

6.Clustering with scikit-learn

7.Gradient Boosting with XGBoost

8.Deep Learning with TensorFlow

9.Deep Learning with Keras

Mock Interview

Introduction

A. Data processing

B. NumPy