qdap (Quantitative Discourse Analysis Package) is an R package designed to assist in quantitative discourse analysis. The package stands as a bridge between qualitative transcripts of dialogue and statistical analysis & visualization.
If there is a discrepancy between the R and Java architectures you will have to download the appropriate version of Java compatible with the version of R you're using. For more see Tal Galili's blog post regarding rJava issues.
Download the development version of qdap here
A function to generate a project template of folders, scripts and documents.
new_project
Functions for importing data and exporting output.
condense
dir_map
mcsv_r
mcsv_w
read.transcript
Function to clean and parse text data.
bracketX
bracketXtract
genX
genXtract
beg2char
char2end
capitalizer
check_spelling
which_misspelled
check_spelling_interactive
correct
check_text
clean
comma_spacer
incomplete_replace
incomp
multigsub
mgsub
sub_holder
name2sex
potential_NA
qprep
replace_abbreviation
replace_contraction
replace_number
replace_ordinal
replace_symbol
rm_row
rm_empty_row
rm_stopwords
rm_stop
%sw%
scrubber
space_fill
spaste
stemmer
stem_words
stem2df
Trim
Functions to aid data viewing.
htruncdf
ltruncdf
lview
qview
truncdf
left_just
right_just
Search
boolean_search
%bs%
strWrap
Functions chain together qdap data and functions.
Functions to aid in selection of data elements.
counts
preprocessed
proportions
scores
visual
Functions to reshape data.
adjacency_matrix
adjmat
colSplit
colsplit2df
lcolsplit2df
colcomb2class
gantt
plot_gantt_base
gantt_rep
key_merge
paste2
colpaste2df
prop
qcombine
sentSplit
sent_detect
TOT
sentCombine
speakerSplit
trans_context
Functions for working with dialogue at the word level.
all_words
bag_o_words
unbag
breaker
word_split
chunker
common
exclude
exclude.DocumentTermMatrix
%ex%
exclude.TermDocumentMatrix
exclude.default
exclude.list
exclude.wfm
freq_terms
ngrams
strip
synonyms
syn
synonyms_frame
syn_frame
word_associate
word_diff_list
word_list
cm functions are code matrix functions. These functions are used for coding and reshaping transcripts, dataframes, and time spans for further use in analysis and visualization.
summary.cmspans
cm_range.temp
cm_df.transcript
cm_time.temp
cm_df.temp
cm_2long
cm_df2long
cm_range2long
cm_time2long
cm_code.blank
cm_code.combine
cm_code.exclude
cm_code.overlap
cm_code.transform
cm_distance
cm_dummy2long
cm_long2dummy
Functions for working between the tm and qdap packages.
as.tdm
apply_as_df
apply_as_tm
as.Corpus
as.Corpus.DocumentTermMatrix
as.Corpus.TermDocumentMatrix
as.Corpus.default
as.Corpus.sent_split
as.Corpus.wfm
as.DocumentTermMatrix
as.TermDocumentMatrix
as.data.frame.Corpus
as.dtm
as.dtm.Corpus
as.dtm.character
as.dtm.default
as.dtm.wfm
as.tdm.Corpus
as.tdm.character
as.tdm.default
as.tdm.wfm
Functions for word counts and descriptive statistics.
dist_tab
multiscale
object_pronoun_type
outlier_detect
outlier_labeler
pos
pos_by
pos_tags
pronoun_type
question_type
subject_pronoun_type
syllable_sum
combo_syllable_sum
polysyllable_sum
syllable_count
termco
termco_d
term_match
termco2mat
termco_c
wfm
wfdf
as.wfm
weight.wfdf
weight.wfm
wfm_combine
wfm_expanded
wfm.wfdf
wfm.character
wfm.factor
as.wfm.Corpus
as.wfm.DocumentTermMatrix
as.wfm.TermDocumentMatrix
as.wfm.data.frame
as.wfm.default
as.wfm.matrix
as.wfm.wfdf
wfm.Corpus
word_count
wc
character_count
character_table
char_table
word_stats
Word measures and scoring.
automated_readability_index
coleman_liau
flesch_kincaid
fry
linsear_write
SMOG
Dissimilarity
diversity
formality
kullback_leibler
polarity
word_cor
word_proximity
weight.word_proximity
Tools to assist in transcript/discourse analysis.
blank2NA
build_qdap_vignette
duplicates
qcv
replacer
Identify sentence elements/types.
end_inc
end_mark
end_mark_by
imperative
NAer
Plotting functions.
dispersion_plot
gradient_cloud
gantt_plot
gantt_wrap
phrase_net
plot.character_table
plot.cmspans
plot.diversity
plot.formality
plot.gantt
plot.kullback_leibler
plot.polarity
plot.pos_by
plot.question_type
plot.rmgantt
plot.sent_split
plot.sum_cmspans
plot.sums_gantt
plot.termco
plot.wfdf
plot.wfm
plot.word_stats
qheat
qheat.character_table
qheat.default
qheat.diversity
qheat.pos_by
qheat.question_type
qheat.termco
qheat.word_stats
rank_freq_mplot
rank_freq_plot
tot_plot
trans_cloud
trans_venn
word_network_plot
Network Plots for qdap Objects.
discourse_map
Network
Network.formality
Network.polarity
qtheme
theme_badkitchen
theme_cafe
theme_duskheat
theme_grayscale
theme_greyscale
theme_hipster
theme_nightheat
theme_norah
Network Plots for qdap Objects.
cumulative
cumulative.animated_formality
cumulative.animated_polarity
cumulative.combo_syllable_sum
cumulative.end_mark
cumulative.formality
cumulative.polarity
cumulative.pos
cumulative.pos_by
cumulative.syllable_freq
Animate qdap Objects.
Animate
Animate.discourse_map
Animate.formality
Animate.gantt
Animate.gantt_plot
Animate.polarity
vertex_apply
edge_apply
print.all_words
print.adjacency_matrix
print.boolean_qdap
print.character_table
print.cm_distance
print.colsplit2df
print.Dissimilarity
print.diversity
print.formality
print.kullback_leibler
print.ngrams
print.polarity
print.pos
print.pos_by
print.qdap_context
print.question_type
print.sent_split
print.sum_cmspans
print.sums_gantt
print.termco
print.wfm
print.word_associate
print.word_list
print.word_stats
Data sets included in qdap and used in examples.
DATA
DATA2
DATA.SPLIT
pres_debates2012
pres_debate_raw2012
mraja1
mraja1spl
raj.act.1
raj.act.2
raj.act.3
raj.act.4
raj.act.5
raj.demographics
raj
rajPOS
rajSPLIT
raw.time.span
sample.time.span