vitals: Large Language Model Evaluation (original) (raw)
A port of 'Inspect', a widely adopted 'Python' framework for large language model evaluation. Specifically aimed at 'ellmer' users who want to measure the effectiveness of their large language model-based products, the package supports prompt engineering, tool usage, multi-turn dialog, and model graded evaluations.
| Version: | 0.1.0 |
|---|---|
| Depends: | R (≥ 4.1) |
| Imports: | cli, dplyr, ellmer (≥ 0.2.1), glue, httpuv, jsonlite, purrr, R6, rlang, rstudioapi, S7, tibble, tidyr, withr |
| Suggests: | ggplot2, here, htmltools, knitr, ordinal, rmarkdown, testthat (≥ 3.0.0) |
| Published: | 2025-06-24 |
| DOI: | 10.32614/CRAN.package.vitals |
| Author: | Simon Couch |
| Maintainer: | Simon Couch <simon.couch at posit.co> |
| BugReports: | https://github.com/tidyverse/vitals/issues |
| License: | MIT + file |
| URL: | https://github.com/tidyverse/vitals, https://vitals.tidyverse.org |
| NeedsCompilation: | no |
| Materials: | README, NEWS |
| CRAN checks: | vitals results |
Documentation:
Downloads:
Linking:
Please use the canonical formhttps://CRAN.R-project.org/package=vitalsto link to this page.