tesseract: Open Source OCR Engine (original) (raw)
Bindings to 'Tesseract': a powerful optical character recognition (OCR) engine that supports over 100 languages. The engine is highly configurable in order to tune the detection algorithms and obtain the best possible results.
| Version: | 5.2.3 |
|---|---|
| Imports: | Rcpp (≥ 0.12.12), pdftools (≥ 1.5), curl, rappdirs, digest |
| LinkingTo: | Rcpp |
| Suggests: | magick (≥ 1.7), spelling, knitr, tibble, rmarkdown |
| Published: | 2025-03-23 |
| DOI: | 10.32614/CRAN.package.tesseract |
| Author: | Jeroen Ooms |
| Maintainer: | Jeroen Ooms |
| BugReports: | https://github.com/ropensci/tesseract/issues |
| License: | Apache License 2.0 |
| URL: | https://docs.ropensci.org/tesseract/ https://ropensci.r-universe.dev/tesseract |
| NeedsCompilation: | yes |
| SystemRequirements: | Tesseract >= 3.03 (libtesseract-dev / tesseract-devel) and Leptonica (libleptonica-dev / leptonica-devel). On Debian you need to install the English training data separately (tesseract-ocr-eng) |
| Language: | en-US |
| Materials: | |
| In views: | NaturalLanguageProcessing |
| CRAN checks: | tesseract results |
Documentation:
| Reference manual: | tesseract.html , <tesseract.pdf> |
|---|---|
| Vignettes: | Using the Tesseract OCR engine in R (source, R code) |
Downloads:
| Package source: | tesseract_5.2.3.tar.gz |
|---|---|
| Windows binaries: | r-devel: tesseract_5.2.3.zip, r-release: tesseract_5.2.3.zip, r-oldrel: tesseract_5.2.3.zip |
| macOS binaries: | r-release (arm64): tesseract_5.2.3.tgz, r-oldrel (arm64): tesseract_5.2.3.tgz, r-release (x86_64): tesseract_5.2.3.tgz, r-oldrel (x86_64): tesseract_5.2.3.tgz |
| Old sources: | tesseract archive |
Reverse dependencies:
| Reverse suggests: | camtrapR, imagerExtra, inlpubs, LLMAgentR, magick, orderanalyzer, pdftools, poldis |
|---|
Linking:
Please use the canonical formhttps://CRAN.R-project.org/package=tesseractto link to this page.