word_list(text.var, grouping.var = NULL, stopwords = NULL, alphabetical = FALSE, cut.n = 20, cap = TRUE, cap.list = NULL, cap.I = TRUE, rm.bracket = TRUE, char.keep = NULL, apostrophe.remove = FALSE, ...)
NULLgenerates one word list for all text. Also takes a single grouping variable or a list of 1 or more grouping variables.
TRUEthe output of frequency lists is ordered alphabetically. If
FALSEthe list is ordered by frequency rank.
TRUEcapitalizes words from the cap.list.
TRUEcapitalizes words containing the personal pronoun I.
TRUEall brackets and bracketed text are removed from analysis.
word_listshould keep. The default is to remove every symbol except apostrophes.
TRUEremoves apostrophes from the output.
An object of class
"word_list" is a list of lists of vectors
or dataframes containing the following components:
cwlcomplete word list; raw words
swlstop word list; same as rwl with stop words removed
fwlfrequency word list; a data frame of words and corresponding
fswlfrequency stopword word list; same as fwl but with stop words
rfswlreduced frequency stopword word list; same as fswl but truncated
to n rows
Transcript Apply Raw Word Lists and Frequency Counts by grouping variable(s).
## <strong>Not run</strong>: # word_list(raj.act.1$dialogue) # # out1 <- with(raj, word_list(text.var = dialogue, # grouping.var = list(person, act))) # names(out1) # lapply(out1$cwl, "[", 1:5) # # with(DATA, word_list(state, person)) # with(DATA, word_list(state, person, stopwords = Top25Words)) # with(DATA, word_list(state, person, cap = FALSE, cap.list=c("do", "we"))) # ## <strong>End(Not run)</strong>