Search⌘ K

US Hurricane Data: 1851–2019

Explore techniques to clean and preprocess historical US hurricane data by filtering rows, handling missing values, and converting data types. Understand how to prepare the dataset for analysis and visualization in Python using Pandas and Jupyter Notebook.

Cleaning up the hurricane data

If you quickly look through the webpage itself, you’ll see some formatting that’ll need cleaning up. Each decade is introduced with a single row containing nothing but a string looking like “1850s” and so on… We’ll want to drop those rows. Years with no events have the string “None” in the second column. Those, too, will need to go.

Some events have no data for their “Max Wind (kt)” speeds. Instead of a number (measured in knots), the speed values for those events are represented by five dashes ...