Improving Prediction of Complications Post-Proton Therapy in Lung Cancer Using Large Language Models and Meta-Analysis

Pei-Ju Chao; Chu-Ho Chang; Jyun-Jie Wu; Yen-Hsien Liu; Junping Shiau; Hsin-Hung Shih; Guang-Zhi Lin; Shen-Hao Lee; Tsair-Fwu Lee

doi:10.1177/10732748241286749

Improving Prediction of Complications Post-Proton Therapy in Lung Cancer Using Large Language Models and Meta-Analysis

Cancer Control. 2024 Jan-Dec:31:10732748241286749. doi: 10.1177/10732748241286749.

Authors

Pei-Ju Chao^{1

2}, Chu-Ho Chang¹, Jyun-Jie Wu¹, Yen-Hsien Liu¹, Junping Shiau¹, Hsin-Hung Shih¹, Guang-Zhi Lin¹, Shen-Hao Lee^{1

2

3}, Tsair-Fwu Lee^{1

4

5

6}

Affiliations

¹ Medical Physics and Informatics Laboratory of Electronics Engineering, National Kaohsiung University of Science and Technology, Kaohsiung, Taiwan.
² Department of Radiation Oncology, Kaohsiung Chang Gung Memorial Hospital and Chang Gung University College of Medicine, Kaohsiung, Taiwan.
³ Department of Radiation Oncology, Linkou Chang Gung Memorial Hospital and Chang Gung University College of Medicine, Linkou, Taiwan.
⁴ Graduate Institute of Clinical Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan.
⁵ Department of Medical Imaging and Radiological Sciences, Kaohsiung Medical University, Kaohsiung, Taiwan.
⁶ School of Dentistry, College of Dental Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan.

Abstract

Purpose: This study enhances the efficiency of predicting complications in lung cancer patients receiving proton therapy by utilizing large language models (LLMs) and meta-analytical techniques for literature quality assessment.

Materials and methods: We integrated systematic reviews with LLM evaluations, sourcing studies from Web of Science, PubMed, and Scopus, managed via EndNote X20. Inclusion and exclusion criteria ensured literature relevance. Techniques included meta-analysis, heterogeneity assessment using Cochran's Q test and I² statistics, and subgroup analyses for different complications. Quality and bias risk were assessed using the PROBAST tool and further analyzed with models such as ChatGPT-4, Llama2-13b, and Llama3-8b. Evaluation metrics included AUC, accuracy, precision, recall, F1 score, and time efficiency (WPM).

Results: The meta-analysis revealed an overall effect size of 0.78 for model predictions, with high heterogeneity observed (I² = 72.88%, P < 0.001). Subgroup analysis for radiation-induced esophagitis and pneumonitis revealed predictive effect sizes of 0.79 and 0.77, respectively, with a heterogeneity index (I²) of 0%, indicating that there were no significant differences among the models in predicting these specific complications. A literature assessment using LLMs demonstrated that ChatGPT-4 achieved the highest accuracy at 90%, significantly outperforming the Llama3 and Llama2 models, which had accuracies ranging from 44% to 62%. Additionally, LLM evaluations were conducted 3229 times faster than manual assessments were, markedly enhancing both efficiency and accuracy. The risk assessment results identified nine studies as high risk, three as low risk, and one as unknown, confirming the robustness of the ChatGPT-4 across various evaluation metrics.

Conclusion: This study demonstrated that the integration of large language models with meta-analysis techniques can significantly increase the efficiency of literature evaluations and reduce the time required for assessments, confirming that there are no significant differences among models in predicting post proton therapy complications in lung cancer patients.

Keywords: ChatGPT; large language model; lung cancer; meta-analysis; prediction model risk of bias assessment tool; proton therapy.

Plain language summary

Using Advanced AI to Improve Predictions of Treatment Side Effects in Lung Cancer: This research uses cutting-edge artificial intelligence (AI) techniques, including large language models like ChatGPT-4, to better predict potential side effects in lung cancer patients undergoing proton therapy. By analyzing extensive scientific literature quickly and accurately, this approach has proven to enhance the evaluation process, making it faster and more reliable in foreseeing complications from treatments.

Publication types

Meta-Analysis

MeSH terms

Humans
Lung Neoplasms* / radiotherapy
Proton Therapy* / adverse effects
Proton Therapy* / methods