Numeric Univariate Analysis
Learn what a numeric variable is and how to undertake visualization of these common data types.
Numeric variables
A numeric variable is a type of data in statistics that represents numerical values. It can be either continuous (representing an infinite number of possible values, such as arm length) or discrete (representing a limited number of possible countable values, such as the number of pets a household has).
Univariate analysis and numeric variables
Numeric univariate analysis is a statistical method that focuses on analyzing the distribution and characteristics of a single numerical variable in a dataset. From here, we can understand its spread, central tendency, and whether any outliers exist.
Popular numeric univariate analysis techniques include boxplots, histograms, violin plots, and Empirical Cumulative Distribution Function (ECDF) plots.
Using the .describe()
DataFrame method in pandas is another great way of analyzing a single numerical variable at a time, giving us popular summary statistics (e.g., mean, median, standard deviation).
We will use a 2023 golfer dataset detailing the average hitting (driving) distances of professional golfers.
Get hands-on with 1400+ tech skills courses.