Quiz: Tokens, N-Grams, tf-idf, and Stemming
Test your knowledge of document search strategies, tokenization, n-grams, stemming, and tf-idf.
Natural Language Tools
1
What will we get from the following command? findFreqTerms(DTmatrix, lowfreq = 400)
A)
This creates a list of terms that appear less than 400 times.
B)
This will create a list of the most frequent terms for each document.
C)
This command creates a list of terms that appear a minimum of 400 times in DTmatrix
.
D)
findFreqTerms(DTmatrix, lowfreq = 400)
trims DTmatrix
to a list of 400 terms.
Question 1 of 40 attempted
Get hands-on with 1200+ tech skills courses.