rtika: R Interface to 'Apache Tika' (original) (raw)
Extract text or metadata from over a thousand file types, using Apache Tika <https://tika.apache.org/>. Get either plain text or structured XHTML content.
Version:
2.7.0
Depends:
R (≥ 3.5.0)
Imports:
curl, sys (≥ 2.1), stats, utils, digest, backports
Suggests:
jsonlite, xml2, testthat, knitr, rmarkdown, covr, magrittr
Published:
2023-05-04
DOI:
Author:
Sasha Goodman [aut, cre], The Apache Software Foundation [aut, cph], Julia Silge [rev] (Reviewed the package for rOpenSci, see https://github.com/ropensci/software-review/issues/191/), David Gohel [rev] (Reviewed the package for rOpenSci, see https://github.com/ropensci/software-review/issues/191/)
Maintainer:
Sasha Goodman
BugReports:
https://github.com/ropensci/rtika/issues/
License:
Apache License 2.0 | file
URL:
https://docs.ropensci.org/rtika/,https://github.com/ropensci/rtika/
NeedsCompilation:
no
SystemRequirements:
Java (>=8)
Materials:
CRAN checks: