AgeTopicModels: Inferring Age-Dependent Disease Topic from Diagnosis Data (original) (raw)
We propose an age-dependent topic modelling (ATM) model, providing a low-rank representation of longitudinal records of hundreds of distinct diseases in large electronic health record data sets. The model assigns to each individual topic weights for several disease topics; each disease topic reflects a set of diseases that tend to co-occur as a function of age, quantified by age-dependent topic loadings for each disease. The model assumes that for each disease diagnosis, a topic is sampled based on the individual’s topic weights (which sum to 1 across topics, for a given individual), and a disease is sampled based on the individual’s age and the age-dependent topic loadings (which sum to 1 across diseases, for a given topic at a given age). The model generalises the Latent Dirichlet Allocation (LDA) model by allowing topic loadings for each topic to vary with age. References: Jiang (2023) <doi:10.1038/s41588-023-01522-8>.
| Version: | 0.1.0 |
|---|---|
| Depends: | R (≥ 3.5) |
| Imports: | dplyr, ggplot2, ggrepel, grDevices, gtools, magrittr, pROC, reshape2, rlang, stats, stringr, tibble, tidyr, utils |
| Suggests: | testthat (≥ 3.0.0) |
| Published: | 2025-10-21 |
| DOI: | 10.32614/CRAN.package.AgeTopicModels |
| Author: | Xilin Jiang |
| Maintainer: | Xilin Jiang |
| License: | MIT + file |
| NeedsCompilation: | no |
| Materials: | README, NEWS |
| CRAN checks: | AgeTopicModels results |
Documentation:
Downloads:
Linking:
Please use the canonical formhttps://CRAN.R-project.org/package=AgeTopicModelsto link to this page.