Handling Overplotting and Outlier Values

Explore methods for handling overplotting and outlier values in scatterplots using Plotly and Dash. Learn to adjust marker opacity, size, symbols, and apply logarithmic scales to improve chart readability and data interpretation.

We'll cover the following...

Controlling the opacity and size of markers
Using logarithmic scales

Let’s say we are now interested in seeing the relationship between our variable and population for the same year that we have been working on. We want to have Population, total on the $x$ axis and perc_pov_19 on the $y$ axis.

We first create a subset of poverty in which year is equal to 2010 and is_country is True, and sort the values using Population, total:

df =\
poverty[poverty['year'].eq(2010) & poverty['is_country']]
.sort_values('Population, total')

Now let’s see how to plot those two variables. Here is the code:

px.scatter(df,
  y=perc_pov_19,
  x='Population, total',
  title=' - '.join([perc_pov_19, '2010']),
  height=500)

1.Plotly's Dash Framework

2.Overview of the Dash Ecosystem

3.Exploring the Structure of a Dash App

4.Working with Plotly's Figure Objects

5.Data Manipulation and Preparation using Plotly Express

6.Interactively Comparing Values with Bar Charts and Drop-Down Menus

7.Exploring Variables and Filtering Subsets

Project

8.Exploring Map Plots and Enriching Dashboards with Markdown

9.Calculating the Frequency of Data with Histograms and Tables

10.Letting the Data Speak for Itself with Machine Learning

11.Turbocharge Apps with Advanced Callbacks

12.URLs and Multipage Apps

13.Deploying the App

14.Next Steps

15.Appendix

Project

Handling Overplotting and Outlier Values