Amazon Review Data (2018)
Understand the structure and scale of the 2018 Amazon review dataset including reviews, metadata, and product categories. Learn to handle large datasets by comparing Pandas and PySpark performance, preparing you to efficiently process and analyze extensive data collections.
We'll cover the following...
We'll cover the following...
Description
This dataset is an updated version of the “Amazon review dataset" released in 2014. As in the previous version, this dataset includes reviews (ratings, text, helpfulness votes), product metadata (descriptions, category information, price, brand, and image features), and links (also viewed/also bought graphs). In addition, this version provides the following features:
-
More reviews:
- The total number of reviews is 233.1 million