Exploring Multilingual Large Language Models for Enhanced TNM Classification of Radiology Report in Lung Cancer Staging

Hidetoshi Matsuo; Mizuho Nishio; Takaaki Matsunaga; Koji Fujimoto; Takamichi Murakami

doi:10.3390/cancers16213621

Exploring Multilingual Large Language Models for Enhanced TNM Classification of Radiology Report in Lung Cancer Staging

Cancers (Basel). 2024 Oct 26;16(21):3621. doi: 10.3390/cancers16213621.

Authors

Hidetoshi Matsuo¹, Mizuho Nishio^{1

2}, Takaaki Matsunaga¹, Koji Fujimoto³, Takamichi Murakami¹

Affiliations

¹ Department of Radiology, Kobe University Graduate School of Medicine, Kobe 650-0017, Japan.
² Center for Advanced Medical Engineering Research & Development, Kobe University, Kobe 650-0047, Japan.
³ Advanced Imaging in Medical Magnetic Resonance, Kyoto University, Kyoto 606-8507, Japan.

Abstract

Background/objectives: This study aimed to investigate the accuracy of Tumor, Node, Metastasis (TNM) classification based on radiology reports using GPT3.5-turbo (GPT3.5) and the utility of multilingual large language models (LLMs) in both Japanese and English.

Methods: Utilizing GPT3.5, we developed a system to automatically generate TNM classifications from chest computed tomography reports for lung cancer and evaluate its performance. We statistically analyzed the impact of providing full or partial TNM definitions in both languages using a generalized linear mixed model.

Results: The highest accuracy was attained with full TNM definitions and radiology reports in English (M = 94%, N = 80%, T = 47%, and TNM combined = 36%). Providing definitions for each of the T, N, and M factors statistically improved their respective accuracies (T: odds ratio [OR] = 2.35, p < 0.001; N: OR = 1.94, p < 0.01; M: OR = 2.50, p < 0.001). Japanese reports exhibited decreased N and M accuracies (N accuracy: OR = 0.74 and M accuracy: OR = 0.21).

Conclusions: This study underscores the potential of multilingual LLMs for automatic TNM classification in radiology reports. Even without additional model training, performance improvements were evident with the provided TNM definitions, indicating LLMs' relevance in radiology contexts.

Keywords: TNM classification; lung cancer; multilingual large language models; radiology.

Abstract

Grants and funding