vitals: Large Language Model Evaluation (original) (raw)

A port of 'Inspect', a widely adopted 'Python' framework for large language model evaluation. Specifically aimed at 'ellmer' users who want to measure the effectiveness of their large language model-based products, the package supports prompt engineering, tool usage, multi-turn dialog, and model graded evaluations.

Version:	0.1.0
Depends:	R (≥ 4.1)
Imports:	cli, dplyr, ellmer (≥ 0.2.1), glue, httpuv, jsonlite, purrr, R6, rlang, rstudioapi, S7, tibble, tidyr, withr
Suggests:	ggplot2, here, htmltools, knitr, ordinal, rmarkdown, testthat (≥ 3.0.0)
Published:	2025-06-24
DOI:	10.32614/CRAN.package.vitals
Author:	Simon Couch [aut, cre], Max Kuhn [ctb], Hadley Wickham [ctb], Mine Cetinkaya-Rundel [ctb], Posit Software, PBC [cph, fnd]
Maintainer:	Simon Couch <simon.couch at posit.co>
BugReports:	https://github.com/tidyverse/vitals/issues
License:	MIT + file
URL:	https://github.com/tidyverse/vitals, https://vitals.tidyverse.org
NeedsCompilation:	no
Materials:	README, NEWS
CRAN checks:	vitals results

Documentation:

Downloads:

Linking:

Please use the canonical formhttps://CRAN.R-project.org/package=vitalsto link to this page.