doi:10.1006/csla.2001.0169>) or extracting them into new variables. For example, emoticons are often used in text but not always easily handled by analysis algorithms. The replace_emoticon() function replaces emoticons with word equivalents.">

textclean: Text Cleaning Tools (original) (raw)

Tools to clean and process text. Tools are geared at checking for substrings that are not optimal for analysis and replacing or removing them (normalizing) with more analysis friendly substrings (see Sproat, Black, Chen, Kumar, Ostendorf, & Richards (2001) <doi:10.1006/csla.2001.0169>) or extracting them into new variables. For example, emoticons are often used in text but not always easily handled by analysis algorithms. The replace_emoticon() function replaces emoticons with word equivalents.

Version: 0.9.3
Depends: R (≥ 3.4.0)
Imports: data.table, english (≥ 1.0-2), glue (≥ 1.3.0), lexicon (≥ 1.0.0), mgsub (≥ 1.5.0), qdapRegex, stringi, textshape (≥ 1.0.1), utils
Suggests: testthat
Published: 2018-07-23
DOI: 10.32614/CRAN.package.textclean
Author: Tyler Rinker [aut, cre], ctwheels StackOverflow [ctb]
Maintainer: Tyler Rinker <tyler.rinker at gmail.com>
BugReports: https://github.com/trinker/textclean/issues
License: GPL-2
URL: https://github.com/trinker/textclean
NeedsCompilation: no
Citation: textclean citation info
Materials: README
CRAN checks: textclean results

Documentation:

Downloads:

Reverse dependencies:

Linking:

Please use the canonical formhttps://CRAN.R-project.org/package=textcleanto link to this page.