Stem Text

Usage

stemmer(text.var, rm.bracket = TRUE, capitalize = TRUE, warn = TRUE, char.keep = "~~", ...)
stem_words(...)
stem2df(dataframe, text.var, stem.name = NULL, ...)

Arguments

text.var
The text variable. In stemmer this is a vector text string. For stem2df this is a character vector of length one naming the text column.
rm.bracket
logical. If TRUE brackets are removed from the text.
capitalize
logical. If TRUE selected terms are capitalized.
warn
logical. If TRUE warns about rows not ending with standard qdap punctuation endmarks.
char.keep
A character vector of symbols that should be kept within sentences.
dataframe
A dataframe object.
stem.name
A character vector of length one for the stemmed column. If NULL defaults to "stem.text".
...
Various: stemmer - Other arguments passed to capitalizer stem_words - Words or terms. stem2df - Other arguments passed to stemmer

Stem Text

Value

stemmer - returns a character vector with stemmed text.

stem_words - returns a vector of individually stemmed words.

stem2df - returns a dataframe with a character vector with stemmed text.

Description

stemmer - Stems a vector of text strings (A wrapper for the pkgtm package's stemDocument.

stem_words - Wrapper for stemmer that stems a vector of words.

stem2df - Wrapper for stemmer that stems a vector of text strings and returns a dataframe with the vector added..

Examples

## <strong>Not run</strong>: # #stemmer EXAMPLE: # stemmer(DATA$state) # out1 <- stemmer(raj$dialogue) # htruncdf(out1, 20, 60) # # #stem_words EXAMPLE: # stem_words(doggies, jumping, swims) # # #stem2df EXAMPLE: # out2 <- stem2df(DATA, "state", "new") # truncdf(out2, 30) # ## <strong>End(Not run)</strong>

See also

capitalizer