Storing and Protecting Data
Understand how to store data effectively using CSV, JSON, and Jupyter notebook files in a Python analytics environment. Learn the importance of protecting your data through reliable backups and safeguarding your files against loss or corruption.
We'll cover the following...
Storing of the data
We’ll generate data in CSV and JSON files while collecting and cleaning data sets. We’ll also store our Python scripts in .ipynb notebook files. Thankfully, Jupyter will regularly autosave your notebooks. These files are live in a Docker container, as shown in the following figure.
Protecting the data
If the host is shut down for the last time and decommissioned (or corrupted beyond repair), our CSV and .ipynb files go with it. To preserve all
that data, make sure up-to-date copies exist in reliable places.
After all, data doesn’t back itself up.
Where are the live CSV and JSON files?