Natural Language Processing Identifies Underdocumentation of Symptoms in Patients on Hemodialysis - PubMed (original) (raw)
Natural Language Processing Identifies Underdocumentation of Symptoms in Patients on Hemodialysis
Yang Dai et al. Kidney360. 2025.
Abstract
Key Points:
- Natural language processing can be used to identify patient symptoms from the electronic health records with good performance when compared with manual chart review.
- Natural language processing–extracted patient symptom burden does not reflect patient burden due to under-recognition and underdocumentation by health care professionals.
Background: Patients on hemodialysis have a high burden of emotional and physical symptoms. These symptoms are often under-recognized. Natural language processing (NLP) can be used to identify patient symptoms from electronic health records (EHRs). However, whether symptom documentation matches patient-reported burden is unclear.
Methods: We conducted a prospective study of patients seen at an ambulatory nephrology practice from September 2020 to April 2021. We collected symptom surveys from patients, nurses, and physicians. We then developed an NLP algorithm to identify symptoms from the patients' EHRs and validated the performance of this algorithm using manual chart review and patient surveys as a reference standard. Using patient surveys as the reference standard, we compared symptom identification by (1) physicians, (2) nurses, (3) physicians or nurses, and (4) NLP.
Results: We enrolled 97 patients into our study, 63% were female, 49% were non-Hispanic Black, and 41% were Hispanic. The most common symptoms reported by patients were fatigue (61%), cramping (59%), dry skin (53%), muscle soreness (43%), and itching (41%). Physicians and nurses significantly under-recognized patients' symptoms (sensitivity 0.51 [95% confidence interval (CI), 0.40 to 0.61] and 0.63 [95% CI, 0.52 to 0.72], respectively). Nurses were better at identifying symptoms when patients reported more severe symptoms. There was no difference in results by patients' sex or ethnicity. NLP had a sensitivity of 0.92, specificity of 0.95, positive predictive value of 0.75, and negative predictive value of 0.99 with manual EHR review as the reference standard and a sensitivity of 0.58 (95% CI, 0.47 to 0.68), specificity of 0.73 (95% CI, 0.48 to 0.89), positive predictive value of 0.92 (95% CI, 0.82 to 0.97), and negative predictive value of 0.24 (95% CI, 0.14 to 0.38) compared with patient surveys.
Conclusions: Although patients on hemodialysis report high prevalence of symptoms, symptoms are under-recognized and underdocumented. NLP was accurate at identifying symptoms when they were documented. Larger studies in representative populations are needed to assess the generalizability of the results of the study.
Keywords: ESKD; artificial intelligence; biostatistics; dialysis; hemodialysis; patient self-assessment.
Conflict of interest statement
Disclosure forms, as provided by each author, are available with the online version of the article at http://links.lww.com/KN9/A849.
Figures
Graphical abstract
Figure 1
Number of patients with symptoms, as identified by patients, nurses, health care professionals, and NLP. *Indicates a P value < 0.01 and ** a P value < 0.001 by McNemar’s test for comparison of patient survey with physician survey, nurse survey, and NLP. EHR, electronic health record; NLP, natural language processing.
Figure 2
Overall test parameters of different evaluators. Sensitivity, specificity, PPV, and NPV of nurse, physician, nurse/physician, and NLP for identifying symptoms in patients (A) overall and (B) by individual symptom. NPV, negative predictive value; PPV, positive predictive value.
References
- United States Renal Data System. 2021 USRDS Annual Data Report: Epidemiology of Kidney Disease in the United States. National Institutes of Health, National Institute of Diabetes and Digestive and Kidney Diseases; 2021.
LinkOut - more resources
Full Text Sources
Miscellaneous