Enhanced Artificial Intelligence Strategies in Renal Oncology: Iterative Optimization and Comparative Analysis of GPT 3.5 Versus 4.0

Rui Liang; Anguo Zhao; Lei Peng; Xiaojian Xu; Jianye Zhong; Fan Wu; Fulin Yi; Shaohua Zhang; Song Wu; Jianquan Hou

doi:10.1245/s10434-024-15107-0

Enhanced Artificial Intelligence Strategies in Renal Oncology: Iterative Optimization and Comparative Analysis of GPT 3.5 Versus 4.0

Ann Surg Oncol. 2024 Jun;31(6):3887-3893. doi: 10.1245/s10434-024-15107-0. Epub 2024 Mar 12.

Authors

Rui Liang^#^{1

2

3}, Anguo Zhao^#^{2

4}, Lei Peng^#^{2

3

5

6}, Xiaojian Xu^#¹, Jianye Zhong², Fan Wu⁷, Fulin Yi⁶, Shaohua Zhang^{8

9}, Song Wu^{10

11

12}, Jianquan Hou^{13

14}

Affiliations

¹ Department of Urology, The First Affiliated Hospital of Soochow University, Suzhou, Jiangsu, China.
² Department of Urology, South China Hospital, Medical School, Shenzhen University, Shenzhen, Guangdong, China.
³ Department of Urology, The Third Affiliated Hospital of Shenzhen University (Luohu Hospital Group), Shenzhen University, Shenzhen, Guangdong, China.
⁴ Department of Urology, Medical Center of Soochow University, Suzhou Dushu Lake Hospital, Dushu Lake Hospital Affiliated to Soochow University, Suzhou, Jiangsu, China.
⁵ Department of Urology, Lanzhou University Second Hospital, Lanzhou, Gansu, China.
⁶ North Sichuan Medical College (University), Nanchong, Sichuan, China.
⁷ Faculty of Intelligent Manufacturing and Control Engineering, Shanghai Polytechnic University, Shanghai, China.
⁸ Department of Urology, South China Hospital, Medical School, Shenzhen University, Shenzhen, Guangdong, China. [email protected].
⁹ Department of Urology, The Third Affiliated Hospital of Shenzhen University (Luohu Hospital Group), Shenzhen University, Shenzhen, Guangdong, China. [email protected].
¹⁰ Department of Urology, South China Hospital, Medical School, Shenzhen University, Shenzhen, Guangdong, China. [email protected].
¹¹ Department of Urology, The Third Affiliated Hospital of Shenzhen University (Luohu Hospital Group), Shenzhen University, Shenzhen, Guangdong, China. [email protected].
¹² Department of Urology, Lanzhou University Second Hospital, Lanzhou, Gansu, China. [email protected].
¹³ Department of Urology, The First Affiliated Hospital of Soochow University, Suzhou, Jiangsu, China. [email protected].
¹⁴ Department of Urology, Medical Center of Soochow University, Suzhou Dushu Lake Hospital, Dushu Lake Hospital Affiliated to Soochow University, Suzhou, Jiangsu, China. [email protected].

^# Contributed equally.

PMID: 38472675
DOI: 10.1245/s10434-024-15107-0

Abstract

Background: The rise of artificial intelligence (AI) in medicine has revealed the potential of ChatGPT as a pivotal tool in medical diagnosis and treatment. This study assesses the efficacy of ChatGPT versions 3.5 and 4.0 in addressing renal cell carcinoma (RCC) clinical inquiries. Notably, fine-tuning and iterative optimization of the model corrected ChatGPT's limitations in this area.

Methods: In our study, 80 RCC-related clinical questions from urology experts were posed three times to both ChatGPT 3.5 and ChatGPT 4.0, seeking binary (yes/no) responses. We then statistically analyzed the answers. Finally, we fine-tuned the GPT-3.5 Turbo model using these questions, and assessed its training outcomes.

Results: We found that the average accuracy rates of answers provided by ChatGPT versions 3.5 and 4.0 were 67.08% and 77.50%, respectively. ChatGPT 4.0 outperformed ChatGPT 3.5, with a higher accuracy rate in responses (p < 0.05). By counting the number of correct responses to the 80 questions, we then found that although ChatGPT 4.0 performed better (p < 0.05), both versions were subject to instability in answering. Finally, by fine-tuning the GPT-3.5 Turbo model, we found that the correct rate of responses to these questions could be stabilized at 93.75%. Iterative optimization of the model can result in 100% response accuracy.

Conclusion: We compared ChatGPT versions 3.5 and 4.0 in addressing clinical RCC questions, identifying their limitations. By applying the GPT-3.5 Turbo fine-tuned model iterative training method, we enhanced AI strategies in renal oncology. This approach is set to enhance ChatGPT's database and clinical guidance capabilities, optimizing AI in this field.

Keywords: Artificial intelligence; Chat generative pre-trained transformer; Fine-tuned model; GPT-3.5 Turbo; Renal cell carcinoma.

Publication types

Comparative Study

MeSH terms

Artificial Intelligence*
Carcinoma, Renal Cell* / pathology
Humans
Kidney Neoplasms* / pathology
Prognosis

Abstract

Publication types

MeSH terms

Grants and funding