contentanalysis: Scientific Content and Citation Analysis from PDF Documents (original) (raw)
Provides comprehensive tools for extracting and analyzing scientific content from PDF documents, including citation extraction, reference matching, text analysis, and bibliometric indicators. Supports multi-column PDF layouts, 'CrossRef' API <https://www.crossref.org/documentation/retrieve-metadata/rest-api/> integration, and advanced citation parsing.
| Version: | 0.2.0 |
|---|---|
| Depends: | R (≥ 4.1.0) |
| Imports: | base64enc (≥ 0.1-3), dplyr (≥ 1.1.0), httr2 (≥ 0.2.0), igraph, jsonlite (≥ 2.0.0), magrittr (≥ 2.0.4), openalexR (≥ 2.0.2), pdftools (≥ 3.6.0), purrr (≥ 1.1.0), stringr (≥ 1.5.2), tibble (≥ 3.3.0), tidyr (≥ 1.3.0), tidytext (≥ 0.4.3), visNetwork (≥ 2.1.4) |
| Suggests: | knitr, plotly, RColorBrewer, rmarkdown, scales, stringdist, testthat (≥ 3.0.0), mockery |
| Published: | 2025-10-30 |
| DOI: | 10.32614/CRAN.package.contentanalysis |
| Author: | Massimo Aria |
| Maintainer: | Massimo Aria |
| BugReports: | https://github.com/massimoaria/contentanalysis/issues |
| License: | GPL (≥ 3) |
| URL: | https://github.com/massimoaria/contentanalysis, |
| NeedsCompilation: | no |
| Materials: | README, NEWS |
| CRAN checks: | contentanalysis results |
Documentation:
Downloads:
Reverse dependencies:
Linking:
Please use the canonical formhttps://CRAN.R-project.org/package=contentanalysisto link to this page.