Stemming with tidytext
Learn how tidytext uses SnowballC and Hunspell to accomplish stemming.
We'll cover the following...
tidytext
relies on other packages for stemming:
Stemming with SnowballC
The tidytext
package doesn’t have specific stemming functions and instead relies on SnowballC and standard tidyverse
commands.
The SnowballC package in R is an interface to the Snowball stemming library, which is a collection of algorithms for various languages. These algorithms were developed by Martin Porter and are widely used in natural language processing tasks.
SnowballC includes functions such as wordStem()
, which takes a word as input ...