Measures of Variability

Explore key statistical measures of variability to understand how data is dispersed around the mean. Learn to calculate variance, standard deviation, range, quartiles, percentiles, and the five-number summary to analyze dataset spread effectively.

We'll cover the following...

Measures of Variability or Spread

Measures of Variability or Spread

Measures of Variability also known as the measure of spread shows us the dispersion in the dataset and how the data is distributed around the center (Measure of Location) of the dataset. The most commonly used Measures of Variability are discussed below.

Variance

The Variance is the expected value (mean) of the squared differences of the data values from the mean. It shows us how close or far the values in a dataset are from the mean of the dataset in squared units.

Formula

$s^2={\frac{1}{n-1}\sum_{i=1}^n(x_i-\bar{x})^2}$

$s^2$ is the variance.
$n$ is the total number of values in the dataset
$\sum_{i=1}^n$ is the sum of the values from 1 to n.
$(x_i-\bar{x})^2$ is the square of the difference of each value in the dataset from the mean.
$\sum_{i=1}^n(x_i-\bar{x})^2$ is the sum of all the squared difference of values from the mean.

Example

Lets say we have a list of numbers as 34, 56, 190, 10000, and 45.
Here n = 5 (Number of Values)
The mean of the above list of numbers is calculated as

$\bar{x}=\frac{34 + 56 + 190 + 10000 + 45}{5}=\frac{10325}{5}=2065$

The calculations are done below.

x	x-x̄	(x-x̄) $^2$
34	-2031	4124961
56	-2009	4036081

...

1.What is Data Science ?

2.Applications of Data Science

3.Overview of Libraries

4.Probability and Statistics

5.Machine Learning Part-1

6.Machine Learning Part-2

7.Machine Learning Part-3

8.Deep Learning

9.Machine Learning Tools and Libraries

10.Big Data Tools and Technologies

11.Where to go next ?

Mock Interview