Quiz: Tokens, N-Grams, tf-idf, and Stemming

Test your knowledge of document search strategies, tokenization, n-grams, stemming, and tf-idf.

Natural Language Tools

1

What will we get from the following command? findFreqTerms(DTmatrix, lowfreq = 400)

A)

This creates a list of terms that appear less than 400 times.

B)

This will create a list of the most frequent terms for each document.

C)

This command creates a list of terms that appear a minimum of 400 times in DTmatrix.

D)

findFreqTerms(DTmatrix, lowfreq = 400) trims DTmatrix to a list of 400 terms.

Question 1 of 40 attempted

Get hands-on with 1200+ tech skills courses.