Quiz: Tokens, N-Grams, tf-idf, and Stemming
Test your knowledge of document search strategies, tokenization, n-grams, stemming, and tf-idf.
We'll cover the following...
We'll cover the following...
Natural Language Tools
1.
What will we get from the following command? findFreqTerms(DTmatrix, lowfreq = 400)
A.
This creates a list of terms that appear less than 400 times.
B.
This will create a list of the most frequent terms for each document.
C.
This command creates a list of terms that appear a minimum of 400 times in DTmatrix
.
D.
findFreqTerms(DTmatrix, lowfreq = 400)
trims DTmatrix
to a list of 400 terms.
1 / 4
...