Compare Total Review of 2016 and 2017
Explore how to aggregate and compare total reviews for 2016 and 2017 by month using both Pandas and PySpark. Understand data filtering, joining techniques, and differences in syntax and functionality between these libraries to effectively transform and analyze review data.
We'll cover the following...
We'll cover the following...
Comparison in Pandas
To compare the total reviews of 2016 and 2017, we first need to aggregate the data by the review year and month. Next, we need to count the number of asin for each month. Then we can subset the new DataFrame with a filter to create new, separate DataFrames for 2016 and 2017. ...