textrecipes: Extra 'Recipes' for Text Processing (original) (raw)

Converting text to numerical features requires specifically created procedures, which are implemented as steps according to the 'recipes' package. These steps allows for tokenization, filtering, counting (tf and tfidf) and feature hashing.

Version: 1.0.6
Depends: R (≥ 3.6), recipes (≥ 1.0.7)
Imports: lifecycle, dplyr, generics (≥ 0.1.0), magrittr, Matrix, purrr, rlang, SnowballC, tibble, tokenizers, vctrs, glue
LinkingTo: cpp11
Suggests: covr, data.table, dials (≥ 1.2.0), hardhat, janitor, knitr, modeldata, rmarkdown, sentencepiece, spacyr, stopwords, stringi, testthat (≥ 3.0.0), text2vec, tokenizers.bpe, udpipe, wordpiece
Published: 2023-11-15
DOI: 10.32614/CRAN.package.textrecipes
Author: Emil Hvitfeldt ORCID iD [aut, cre], Michael W. Kearney [cph] (author of count_functions), Posit Software, PBC [cph, fnd]
Maintainer: Emil Hvitfeldt <emil.hvitfeldt at posit.co>
BugReports: https://github.com/tidymodels/textrecipes/issues
License: MIT + file
URL: https://github.com/tidymodels/textrecipes,https://textrecipes.tidymodels.org/
NeedsCompilation: yes
SystemRequirements: "GNU make"
Materials: README NEWS
CRAN checks: textrecipes results

Documentation:

Downloads:

Linking:

Please use the canonical formhttps://CRAN.R-project.org/package=textrecipesto link to this page.