Feature Scaling
In this lesson, we'll uncover the details surrounding Feature Scaling, which is an important step before training the models because it speeds up the computations while building the model.
We'll cover the following
Feature scaling
Feature scaling comes under Feature Engineering. Feature scaling refers to the process of normalizing the features, or columns, or dimensions. Many Machine Learning algorithms are sensitive to the scale or magnitude of the features.
Benefits
It has the following benefits

It helps in gradient descent based algorithms to converge faster.

It helps in distancebased algorithms to give equal weight to each feature while computing the similarity.

It helps to compare the Feature Importance.
DistanceBased Algorithms take into account the distance or similarity between instances of the dataset to do the computation.
Types of feature scaling
There are these two famous types of feature scaling.
 Normalization: It involves rescaling the values of features in the range between 0 and 1. Normalization is a good technique to use if someone doesn’t know the distribution of the input columns or the distribution is not Gaussian. The formula for it is:
$X' = \frac{X  X_{min}}{X_{max}  X_{min}}$

$X'$ is the new rescaled value formed from $X$.

$X_{min}$ is the minimum value in the corresponding feature.

$X_{max}$ is the maximum value in the corresponding feature.

The above formula maps the values $X$ in the range of 0 and 1.
 Standardization: Values in the features are standardized in such a way to have a mean of 0 and a standard deviation of 1. Standardization assumes that the values in the column are normally distributed. This condition doesn’t have to be strictly true, but if the distribution is normal then it is good. The formula for it is:
$X' = \frac{X  \mu}{\sigma}$

$X'$ is the new rescaled value formed from $X$.

$\mu$ is the mean of the relevant feature.

$\sigma$ is the standard deviation of the relevant feature.
Get handson with 1200+ tech skills courses.