Challenge: Data Transformation

Let's solve programming challenges related to data transformation in PySpark.

We'll cover the following

Task

Perform summary statistics on the review_text and vote columns.

Steps

  1. Impute NaN values.
  2. Show statistics of vote.
  3. Show statistics of review_text.

Get hands-on with 1200+ tech skills courses.