Top reviews of 2017

Learn how to filter data in Pandas and PySpark.

Filter top reviews of 2017 in Pandas

We could determine top reviews based on the number of votes a review has received—for example, we could say 20 votes is a top review. However, a much better method would be to take the number of votes for all reviews into account, and use quantile or percentile. We’ll consider a review as a top review only if the number of votes is higher than a particular percentile (75% for example).

Get hands-on with 1200+ tech skills courses.