vitals: Large Language Model Evaluation (original) (raw)

A port of 'Inspect', a widely adopted 'Python' framework for large language model evaluation. Specifically aimed at 'ellmer' users who want to measure the effectiveness of their large language model-based products, the package supports prompt engineering, tool usage, multi-turn dialog, and model graded evaluations.

Version: 0.1.0
Depends: R (≥ 4.1)
Imports: cli, dplyr, ellmer (≥ 0.2.1), glue, httpuv, jsonlite, purrr, R6, rlang, rstudioapi, S7, tibble, tidyr, withr
Suggests: ggplot2, here, htmltools, knitr, ordinal, rmarkdown, testthat (≥ 3.0.0)
Published: 2025-06-24
DOI: 10.32614/CRAN.package.vitals
Author: Simon Couch ORCID iD [aut, cre], Max Kuhn [ctb], Hadley Wickham ORCID iD [ctb], Mine Cetinkaya-RundelORCID iD [ctb], Posit Software, PBC ROR ID [cph, fnd]
Maintainer: Simon Couch <simon.couch at posit.co>
BugReports: https://github.com/tidyverse/vitals/issues
License: MIT + file
URL: https://github.com/tidyverse/vitals, https://vitals.tidyverse.org
NeedsCompilation: no
Materials: README, NEWS
CRAN checks: vitals results

Documentation:

Downloads:

Linking:

Please use the canonical formhttps://CRAN.R-project.org/package=vitalsto link to this page.