Differences In Word Use Between Groups

Usage

word_diff_list(text.var, grouping.var, vs.all = FALSE, vs.all.cut = 1, stopwords = NULL, alphabetical = FALSE, digits = 2)

Arguments

text.var
The text variable.
grouping.var
The grouping variables. Default NULL generates one word list for all text. Also takes a single grouping variable or a list of 1 or more grouping variables.
vs.all
logical. If TRUE looks at each grouping variable against all others ("a" vs. all comparison). If FALSE looks at each "a" vs. "b", comparison (e.g., for groups "a", "b", and "c"; "a" vs. "b", "a" vs. "c" and "b" vs. "c" will be considered).
vs.all.cut
Controls the number of other groups that may share a word (default is 1).
stopwords
A vector of stop words to remove.
alphabetical
logical. If TRUE orders the word lists alphabetized by word. If FALSE order first by frequency and then by word.
digits
the number of digits to be displayed in the proportion column (default is 3).

Differences In Word Use Between Groups

Value

An list of word data frames comparing grouping variables word use against one another. Each dataframe contains three columns: wordThe words unique to that group freqThe number of times that group used that word propThe proportion of that group's overall word use dedicated to that particular word

Description

Look at the differences in word uses between grouping variable(s). Look at all possible "a" vs. "b" combinations or "a" vs. all others.

Examples

## <strong>Not run</strong>: # out1 <- with(DATA, word_diff_list(text.var = state, # grouping.var = list(sex, adult))) # lapply(unlist(out1, recursive = FALSE), head, n=3) # # out2 <- with(DATA, word_diff_list(state, person)) # lapply(unlist(out2, recursive = FALSE), head, n=3) # # out3 <- with(DATA, word_diff_list(state, grouping.var = list(sex, adult), # vs.all=TRUE, vs.all.cut=2)) # # # out4 <- with(mraja1, word_diff_list(text.var = dialogue, # grouping.var = list(mraja1$sex, mraja1$fam.aff))) # # # out5 <- word_diff_list(mraja1$dialogue, mraja1$person) # # out6 <- word_diff_list(mraja1$dialogue, mraja1$fam.aff, stopwords = Top25Words) # # out7 <- word_diff_list(mraja1$dialogue, mraja1$fam.aff, vs.all=TRUE, vs.all.cut=2) # lapply(out7, head, n=3) # ## <strong>End(Not run)</strong>