Data Input and Output: Save a Snapshot
Learn to save progress in pandas and PySpark.
We'll cover the following...
While working in our day-to-day lives, we might not be able to finish an entire project by the end of the day. It can be really tedious when we have to re-run all the code from yesterday to get started. To counter this problem, we can create a snapshot of our current work which saves us valuable time when dealing with big data.
We create separate directories for screenshots for pandas
and Pyspark
.
We use the command below to create a directory for screenshots for pandas
:
mkdir -p data/snapshot/pandas
We use the ...