...

/

Quiz: Tokens, N-Grams, tf-idf, and Stemming

Quiz: Tokens, N-Grams, tf-idf, and Stemming

Test your knowledge of document search strategies, tokenization, n-grams, stemming, and tf-idf.

We'll cover the following...

Natural Language Tools

1.

What will we get from the following command? findFreqTerms(DTmatrix, lowfreq = 400)

A.

This creates a list of terms that appear less than 400 times.

B.

This will create a list of the most frequent terms for each document.

C.

This command creates a list of terms that appear a minimum of 400 times in DTmatrix.

D.

findFreqTerms(DTmatrix, lowfreq = 400) trims DTmatrix to a list of 400 terms.


1 / 4
...