https://tika.apache.org/>. Get either plain text or structured XHTML content.">

rtika: R Interface to 'Apache Tika' (original) (raw)

Extract text or metadata from over a thousand file types, using Apache Tika <https://tika.apache.org/>. Get either plain text or structured XHTML content.

Version:

2.7.0

Depends:

R (≥ 3.5.0)

Imports:

curl, sys (≥ 2.1), stats, utils, digest, backports

Suggests:

jsonlite, xml2, testthat, knitr, rmarkdown, covr, magrittr

Published:

2023-05-04

DOI:

10.32614/CRAN.package.rtika

Author:

Sasha Goodman [aut, cre], The Apache Software Foundation [aut, cph], Julia Silge [rev] (Reviewed the package for rOpenSci, see https://github.com/ropensci/software-review/issues/191/), David Gohel [rev] (Reviewed the package for rOpenSci, see https://github.com/ropensci/software-review/issues/191/)

Maintainer:

Sasha Goodman

BugReports:

https://github.com/ropensci/rtika/issues/

License:

Apache License 2.0 | file

URL:

https://docs.ropensci.org/rtika/,https://github.com/ropensci/rtika/

NeedsCompilation:

no

SystemRequirements:

Java (>=8)

Materials:

README NEWS

CRAN checks:

rtika results