Considerations for Patient Privacy of Large Language Models in Health Care: Scoping Review - PubMed (original) (raw)
Considerations for Patient Privacy of Large Language Models in Health Care: Scoping Review
Xiaoying Zhong et al. J Med Internet Res. 2025.
Abstract
Background: The application of large language models (LLMs) in health care holds significant potential for enhancing patient care and advancing medical research. However, the protection of patient privacy remains a critical issue, especially when handling patient health information (PHI).
Objective: This scoping review aims to evaluate the adequacy of current approaches and identify areas in need of improvement to ensure robust patient privacy protection in the existing studies about PHI-LLMs within the health care domain.
Methods: A search of the literature published from January 1, 2022, to July 20, 2025, was performed on July 20, 2025, using 2 databases (PubMed and Embase). This scoping review focused on the following three research questions: (1) What studies on the development and application of LLMs using PHI currently exist within the health care domain? (2) What patient privacy considerations are addressed in existing PHI-LLMs research, and are these measures sufficient? (3) How can future research on the development and application of LLMs using PHI better protect patient privacy? Studies were included if they focused on the development and application of LLMs within health care using PHI, encompassing activities such as model construction, fine-tuning, optimization, testing, and performance comparison. Eligible literature comprised original research articles written in English. Conversely, studies were excluded if they used publicly available datasets, under the assumption that such data have been adequately deidentified. Additionally, non-English publications, reviews, abstracts, incomplete reports, and preprints were excluded from the review due to the lack of rigorous peer review.
Results: This study systematically identified 9823 studies on PHI-LLM and included 464 studies published between 2022 and 2025. Among the 464 studies, (1) a small number of studies neglected ethical review (n=45, 9.7%) and patient informed consent (n=148, 31.9%) during the research process, (2) more than a third of the studies (n=178, 38.4%) failed to report whether to implement effective measures to protect PHI, and (3) there was a significant lack of transparency and comprehensive detail in anonymization and deidentification methods.
Conclusions: We propose comprehensive recommendations across 3 phases-study design, implementation, and reporting-to strengthen patient privacy protection and transparency in PHI-LLM. This study emphasizes the urgent need for the development of stricter regulatory frameworks and the adoption of advanced privacy protection technologies to effectively safeguard PHI. It is anticipated that future applications of LLMs in the health care field will achieve a balance between innovation and robust patient privacy protection, thereby enhancing ethical standards and scientific credibility.
Keywords: health care; large language models; patient health information; patient privacy; scoping review.
©Xiaoying Zhong, Siyi Li, Zhao Chen, Long Ge, Dongdong Yu, Shijia Wang, Liangzhen You, Hongcai Shang. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 21.11.2025.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures
Figure 1
PRISMA (Preferred Reporting Items for Systematic reviews and Meta-Analyses) study selection diagram.
Figure 2
Sankey diagram of disease categories and task types.
References
- Wang L, Wan Z, Ni C, Song Q, Li Y, Clayton E, Malin B, Yin Z. Applications and concerns of ChatGPT and other conversational large language models in health care: systematic review. J Med Internet Res. 2024;26:e22769. doi: 10.2196/22769. https://www.jmir.org/2024//e22769/ v26i1e22769 -DOI -PMC -PubMed
- Varghese J, Chapiro J. ChatGPT: the transformative influence of generative AI on science and healthcare. J Hepatol. 2024;80(6):977–980. doi: 10.1016/j.jhep.2023.07.028. https://linkinghub.elsevier.com/retrieve/pii/S0168-8278(23)05039-0 S0168-8278(23)05039-0 -DOI -PubMed
- Doneva SE, Qin S, Sick B, Ellendorff T, Goldman J, Schneider G, Ineichen BV. Large language models to process, analyze, and synthesize biomedical texts: a scoping review. Discov Artif Intell. 2024;4(1) doi: 10.1007/s44163-024-00197-2. -DOI
- Xu X, Chen Y, Miao J. Opportunities, challenges, and future directions of large language models, including ChatGPT in medical education: a systematic scoping review. J Educ Eval Health Prof. 2024;21:6. doi: 10.3352/jeehp.2024.21.6. https://europepmc.org/abstract/MED/38486402 jeehp.2024.21.6 -DOI -PMC -PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical