strip(x, char.keep = "~~", digit.remove = TRUE, apostrophe.remove = TRUE, lower.case = TRUE)
strip
should keep. The default is to strip every symbol
except apostrophes and a double tilde "~~"
. The double tilde
"~~"
is included for a convenient means of keeping word groups
together in functions that split text apart based on spaces. To remove
double tildes "~~"
set char.keep
to NULL
.TRUE
strips digits from the text.TRUE
removes apostrophes from
the output.TRUE
forces all alpha characters to
lower case.Returns a vector of text that has been stripped of unwanted characters.
Strip text of unwanted characters.
## <strong>Not run</strong>: # DATA$state #no strip applied # strip(DATA$state) # strip(DATA$state, apostrophe.remove=FALSE) # strip(DATA$state, char.keep = c("?", ".")) # ## <strong>End(Not run)</strong>
rm_stopwords