After loading the libraries, we’ll need to generate a scatter plot; we’ll merge our anxiety disorder data frame (anx) with the GDP data using pd.merge. We would do the same to merge our schizophrenia data (sch).

Plot prevalence of anxiety disorder

Plotting the graph won’t involve anything we haven’t seen before in these projects. The labels argument is used to make the plot more readable by assigning descriptive axis labels.

And here’s what came out the other end:

Python 3.5

import pandas as pd
import numpy as np
import matplotlib as plt
import matplotlib.pyplot as plt
import plotly.graph_objs as go
import plotly.express as px
gdp = pd.read_csv('WorldBank_PerCapita_GDP.csv')
anx = pd.read_csv('anxiety.csv')
anx = anx[['Country','Val']]
merged_data_anx = pd.merge(gdp, anx, on='Country')
merged_data_anx = merged_data_anx[['Country', 'Valu\
e', 'Val']]
merged_data_anx.dropna(axis=0, how='any', thresh=None,
subset=None, inplace=True)
fig = px.scatter(merged_data_anx, x="Val", y="Value\
",
    trendline="ols", log_x=True,
    labels={
          "Value": "GDP (in dollars)",
          "Val": "Prevalence of Anxiety \
Disorders (/100k)"
           },
           hover_data=["Country", "Val"])
fig.write_image("output/graph.png")

Python 3.5

import pandas as pd
import numpy as np
import matplotlib as plt
import matplotlib.pyplot as plt
import plotly.graph_objs as go
import plotly.express as px
gdp = pd.read_csv('WorldBank_PerCapita_GDP.csv')
sch = pd.read_csv('schizophrenia.csv')
sch = sch[['Country','Val']]
merged_data_sch = pd.merge(gdp, sch, on='Country')
merged_data_sch = merged_data_sch[['Country', 'Value', 'Val']]
merged_data_sch.dropna(axis=0, how='any', thresh=None, 
                   subset=None, inplace=True)
fig = px.scatter(merged_data_sch, x="Val", y="Value\
",
        trendline="ols", log_x=True,
        labels={
            "Value": "GDP (in dollars)",
            "Val": "Prevalence of Schizoph\
renia (/100k)"
        },
        hover_data=["Country", "Val"])
fig.write_image("output/graph.png")

Python 3.5

import pandas as pd
import numpy as np
import matplotlib as plt
import matplotlib.pyplot as plt
import plotly.graph_objs as go
import plotly.express as px
gdp = pd.read_csv('WorldBank_PerCapita_GDP.csv')
pan = pd.read_csv('pancreatic.csv')
pan = pan[['Country','Val']]
merged_data_pan = pd.merge(gdp, pan, on='Country')
merged_data_pan = merged_data_pan[['Country', 'Value', 'Val']]
merged_data_pan.dropna(axis=0, how='any', thresh=None, 
                   subset=None, inplace=True)
fig = px.scatter(merged_data_pan, x="Val", y="Value", 
                 trendline="ols", log_x=True,
                 labels={
                     "Value": "GDP (in dollars)",
                     "Val": "Prevalence of Pancreatic Cancer (/100k)"
                 },
                 hover_data=["Country", "Val"])
fig.write_image("output/graph.png")

1.Before We Begin

2.Comparing Wages With Consumer Price Index Data

3.Wages and CPI: Reality Check

4.Working With Major US Storm Data

Project

5.Property Rights and Economic Development

6.How Representative Is Your Government?

7.Does Wealth Influence The Prevalence Of Mental Illness?

8.Do Birthdays Make Elite Athletes?

9.Does Literacy Impact The Income of People

10.Conclusion

11.Appendix

Plotting the Data

Plot prevalence of anxiety disorder

Plot prevalence of schizophrenia

Plot prevalence pancreatic cancer

Jupyter notebook in action