Raw Word Lists/Frequency Counts

Usage

word_list(text.var, grouping.var = NULL, stopwords = NULL, alphabetical = FALSE, cut.n = 20, cap = TRUE, cap.list = NULL, cap.I = TRUE, rm.bracket = TRUE, char.keep = NULL, apostrophe.remove = FALSE, ...)

Arguments

text.var
The text variable.
grouping.var
The grouping variables. Default NULL generates one word list for all text. Also takes a single grouping variable or a list of 1 or more grouping variables.
stopwords
A vector of stop words to remove.
alphabetical
If TRUE the output of frequency lists is ordered alphabetically. If FALSE the list is ordered by frequency rank.
cut.n
Cut off point for reduced frequency stop word list (rfswl).
cap
logical. If TRUE capitalizes words from the cap.list.
cap.list
Vector of words to capitalize.
cap.I
logical. If TRUE capitalizes words containing the personal pronoun I.
rm.bracket
logical If TRUE all brackets and bracketed text are removed from analysis.
char.keep
A character vector of symbols (i.e., punctuation) that word_list should keep. The default is to remove every symbol except apostrophes.
apostrophe.remove
logical. If TRUE removes apostrophes from the output.
...
Other arguments passed to strip.

Raw Word Lists/Frequency Counts

Value

An object of class "word_list" is a list of lists of vectors or dataframes containing the following components: cwlcomplete word list; raw words swlstop word list; same as rwl with stop words removed fwlfrequency word list; a data frame of words and corresponding frequency counts fswlfrequency stopword word list; same as fwl but with stop words removed rfswlreduced frequency stopword word list; same as fswl but truncated to n rows

Description

Transcript Apply Raw Word Lists and Frequency Counts by grouping variable(s).

Examples

## <strong>Not run</strong>: # word_list(raj.act.1$dialogue) # # out1 <- with(raj, word_list(text.var = dialogue, # grouping.var = list(person, act))) # names(out1) # lapply(out1$cwl, "[", 1:5) # # with(DATA, word_list(state, person)) # with(DATA, word_list(state, person, stopwords = Top25Words)) # with(DATA, word_list(state, person, cap = FALSE, cap.list=c("do", "we"))) # ## <strong>End(Not run)</strong>