Autonomous International Classification of Diseases Coding Using Pretrained Language Models and Advanced Prompt Learning Techniques: Evaluation of an Automated Analysis System Using Medical Text

Yan Zhuang; Junyan Zhang; Xiuxing Li; Chao Liu; Yue Yu; Wei Dong; Kunlun He

doi:10.2196/63020

Autonomous International Classification of Diseases Coding Using Pretrained Language Models and Advanced Prompt Learning Techniques: Evaluation of an Automated Analysis System Using Medical Text

JMIR Med Inform. 2025 Jan 6:13:e63020. doi: 10.2196/63020.

Authors

Yan Zhuang^#¹, Junyan Zhang^#¹, Xiuxing Li², Chao Liu³, Yue Yu³, Wei Dong⁴, Kunlun He¹

Affiliations

¹ Medical Big Data Research Center, Chinese PLA General Hospital, Beijing, China.
² School of Computer Science & Technology, Beijing Institute of Technology, Beijing, China.
³ Digital Health China Technologies Co Ltd, Beijing, China.
⁴ Senior Department of Cardiology, The Sixth Medical Center of PLA General Hospital, Beijing, China.

^# Contributed equally.

PMID: 39761555
DOI: 10.2196/63020

Abstract

Background: Machine learning models can reduce the burden on doctors by converting medical records into International Classification of Diseases (ICD) codes in real time, thereby enhancing the efficiency of diagnosis and treatment. However, it faces challenges such as small datasets, diverse writing styles, unstructured records, and the need for semimanual preprocessing. Existing approaches, such as naive Bayes, Word2Vec, and convolutional neural networks, have limitations in handling missing values and understanding the context of medical texts, leading to a high error rate. We developed a fully automated pipeline based on the Key-bidirectional encoder representations from transformers (BERT) approach and large-scale medical records for continued pretraining, which effectively converts long free text into standard ICD codes. By adjusting parameter settings, such as mixed templates and soft verbalizers, the model can adapt flexibly to different requirements, enabling task-specific prompt learning.

Objective: This study aims to propose a prompt learning real-time framework based on pretrained language models that can automatically label long free-text data with ICD-10 codes for cardiovascular diseases without the need for semiautomatic preprocessing.

Methods: We integrated 4 components into our framework: a medical pretrained BERT, a keyword filtration BERT in a functional order, a fine-tuning phase, and task-specific prompt learning utilizing mixed templates and soft verbalizers. This framework was validated on a multicenter medical dataset for the automated ICD coding of 13 common cardiovascular diseases (584,969 records). Its performance was compared against robustly optimized BERT pretraining approach, extreme language network, and various BERT-based fine-tuning pipelines. Additionally, we evaluated the framework's performance under different prompt learning and fine-tuning settings. Furthermore, few-shot learning experiments were conducted to assess the feasibility and efficacy of our framework in scenarios involving small- to mid-sized datasets.

Results: Compared with traditional pretraining and fine-tuning pipelines, our approach achieved a higher micro-F1-score of 0.838 and a macro-area under the receiver operating characteristic curve (macro-AUC) of 0.958, which is 10% higher than other methods. Among different prompt learning setups, the combination of mixed templates and soft verbalizers yielded the best performance. Few-shot experiments showed that performance stabilized and the AUC peaked at 500 shots.

Conclusions: These findings underscore the effectiveness and superior performance of prompt learning and fine-tuning for subtasks within pretrained language models in medical practice. Our real-time ICD coding pipeline efficiently converts detailed medical free text into standardized labels, offering promising applications in clinical decision-making. It can assist doctors unfamiliar with the ICD coding system in organizing medical record information, thereby accelerating the medical process and enhancing the efficiency of diagnosis and treatment.

Keywords: BERT; ICD; International Classification of Diseases; bidirectional encoder representations from transformers; cardiovascular disease; few-shot learning; multicenter medical data; pretrained language models; prompt learning.

©Yan Zhuang, Junyan Zhang, Xiuxing Li, Chao Liu, Yue Yu, Wei Dong, Kunlun He. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 06.01.2025.

MeSH terms

Cardiovascular Diseases / diagnosis
Electronic Health Records
Humans
International Classification of Diseases*
Machine Learning*
Natural Language Processing*