![]()
qdap (Quantitative Discourse Analysis Package) is an R package designed to assist in quantitative discourse analysis. The package stands as a bridge between qualitative transcripts of dialogue and statistical analysis and visualization.
Note: Windows users may need to install RCurl before installing qdap. Use the following short script:
URL <- "http://www.stats.ox.ac.uk/pub/RWin/bin/windows/contrib/3.0/"
install.packages("RCurl", contriburl = URL)
install.packages("qdap")
Additionally, if there is a discrepency between the R and Java architectures you will have to download the appropriate version of Java compatable with the version of R you're using.
Download the development version of qdap here
A function to generate a project template of folders, scripts and documents.
new_projectFunctions for importing data and exporting output.
dir_mapmcsv_r mcsv_wread.transcriptFunction to clean and parse text data.
bracketXbracketXtractgenXgenXtractbeg2charchar2endcleanincomplete.replaceincompmultigsub mgsubpotential_NAqprepreplace_abbreviationreplace_contractionreplace_numberreplace_symbolrm_rowrm_empty_rowscrubberspace_fillspastestemmerstem.wordsstem2dfTrimFunctions to aid data viewing.
htruncdfltruncdfqviewtruncdfleft.just right.juststrWrapFunctions to reshape data.
adjacency_matrix adjmatcolSplitcolsplit2dflcolsplit2dfganttgantt_repkey_mergepaste2propqcombinesentSplitsentCombineTOTspeakerSplitFunctions for working with dialogue at the word level.
all_wordsbag.o.wordsbreakerword.splitcommonexcludengramsstopwordsstripsynonyms synword_associateword_diff_listword_listcm functions are code matrix functions. These functions are used for coding and reshaping transcripts, dataframes, and time spans for further use in analysis and visualization.
cm_code.blankcm_code.combinecm_code.excludecm_code.overlapcm_code.transformcm_combine.dummycm_df.fillcm_df.tempcm_df.transcriptcm_df2longcm_distancecm_dummy2longcm_long2dummycm_range.tempcm_range2longcm_time.tempcm_time2longFunctions for word counts and descriptive statistics.
distTabmultiscaleoutlier.detectoutlier.labelerpospos.bypos.tagsquestion_typesyllable.sumcombo_syllable.sumpolysyllable.sumsyllable.counttermcoterm.matchtermco.dtermco2mattermco.cwfm wfdfwf.combinewfm.expandedword.count wccharacter.countcharacter.table char.tableword_statsWord measures and Scoring.
automated_readability_index coleman_liau flesch_kincaid fry linsear_write SMOGdissimilaritydiversityformalitykullback.leiblerpolarityTools to assist in transcript/discourse analysis.
blank2NAcapitalizerduplicateshashhms2seclookupqcvreplacerSearchsec2hmstext2colorurl_dlv.outerIdentify sentence elements/types.
end_incend_markimperativeNAerPlotting functions.
gradient_cloudgantt_plotgantt_wrapplot.character.tableplot.diversityplot.formalityplot.polarityplot.pos.byplot.question_typeplot.termcoplot.word_statsqheatrank_freq_mplotrank_freq_plottot_plottrans.cloudtrans.vennword.network.plotprint.adjacency_matrixprint.character.tableprint.cm_distanceprint.colsplit2dfprint.dissimilarityprint.diversityprint.formalityprint.kullback.leiblerprint.polarityprint.posprint.pos.byprint.question_typeprint.termcoprint.v.outerprint.word_associateprint.word_listprint.word_statsData sets included in qdap and used in examples.
abbreviationsaction.verbsadverbBuckleySaltonSWLcontractionsDATADATA2DICTIONARYemoticonenv.sylenv.synincrease.amplification.wordsinterjectionslabMTmraja1mraja1splnegation.wordsnegative.wordsOnixTxtRetToolkitSWL1positive.wordsprepositionraj.act.1raj.act.2raj.act.3raj.act.4raj.act.5raj.demographicsrajrajPOSrajSPLITSYNONYMTop100WordsTop200WordsTop25Words