A deep learning framework for hepatocellular carcinoma diagnosis using MS1 data

Wei Xu; Liying Zhang; Xiaoliang Qian; Nannan Sun; Xiao Tu; Dengfeng Zhou; Xiaoping Zheng; Jia Chen; Zewen Xie; Tao He; Shugang Qu; Yinjia Wang; Keda Yang; Kunkai Su; Shan Feng; Bin Ju

doi:10.1038/s41598-024-77494-4

A deep learning framework for hepatocellular carcinoma diagnosis using MS1 data

Sci Rep. 2024 Nov 4;14(1):26705. doi: 10.1038/s41598-024-77494-4.

Authors

Wei Xu^#^{1

2}, Liying Zhang^#³, Xiaoliang Qian^#³, Nannan Sun^#³, Xiao Tu^{1

4}, Dengfeng Zhou³, Xiaoping Zheng⁵, Jia Chen^{6

7}, Zewen Xie³, Tao He³, Shugang Qu³, Yinjia Wang⁸, Keda Yang⁹, Kunkai Su¹⁰, Shan Feng^{11

12}, Bin Ju^{13

14}

Affiliations

¹ College of Basic Medical Science, Zhejiang Chinese Medical University, 548 Binwen Rd, Hangzhou, 310053, China.
² Key Laboratory of Chinese Medicine Rheumatology of Zhejiang Province, 548 Binwen Rd, Hangzhou, 310053, China.
³ SanOmics AI Co., Ltd, Lingping District, Hangzhou, 311103, China.
⁴ Key Laboratory of Zhejiang Province, Management of Kidney Disease, Hangzhou, 310000, China.
⁵ Pathology Department, Shulan (Hangzhou) Hospital, Hangzhou, China.
⁶ School of Life Sciences, Key Laboratory of Structural Biology of Zhejiang Province, Westlake University, Hangzhou, 310024, China.
⁷ The Biomedical Research Core Facility, Mass Spectrometry and Metabolomics Core Facility, Westlake University, Hangzhou, 310024, China.
⁸ The First People's Hospital of Kunming, Intensive Care Unit, Kunming, 650032, China. [email protected].
⁹ Key Laboratory of Artificial Organs and Computational Medicine in Zhejiang Province, Shulan International Medical College, Zhejiang Shuren University, Hangzhou, 310015, China. [email protected].
¹⁰ The First Affiliated Hospital, State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, Zhejiang University School of Medicine, 79 Qingchun Road, Hangzhou, 310013, China. [email protected].
¹¹ School of Life Sciences, Key Laboratory of Structural Biology of Zhejiang Province, Westlake University, Hangzhou, 310024, China. [email protected].
¹² The Biomedical Research Core Facility, Mass Spectrometry and Metabolomics Core Facility, Westlake University, Hangzhou, 310024, China. [email protected].
¹³ SanOmics AI Co., Ltd, Lingping District, Hangzhou, 311103, China. [email protected].
¹⁴ Innovative Institute of Basic Medical Sciences, Zhejiang University, Hangzhou, 310022, Zhejiang, China. [email protected].

^# Contributed equally.

Abstract

Clinical proteomics analysis is of great significance for analyzing pathological mechanisms and discovering disease-related biomarkers. Using computational methods to accurately predict disease types can effectively improve patient disease diagnosis and prognosis. However, how to eliminate the errors introduced by peptide precursor identification and protein identification for pathological diagnosis remains a major unresolved issue. Here, we develop a powerful end-to-end deep learning model, termed "MS1Former", that is able to classify hepatocellular carcinoma tumors and adjacent non-tumor (normal) tissues directly using raw MS1 spectra without peptide precursor identification. Our model provides accurate discrimination of subtle m/z differences in MS1 between tumor and adjacent non-tumor tissue, as well as more general performance predictions for data-dependent acquisition, data-independent acquisition, and full-scan data. Our model achieves the best performance on multiple external validation datasets. Additionally, we perform a detailed exploration of the model's interpretability. Prospectively, we expect that the advanced end-to-end framework will be more applicable to the classification of other tumors.

MeSH terms

Biomarkers, Tumor / analysis
Carcinoma, Hepatocellular* / diagnosis
Carcinoma, Hepatocellular* / pathology
Deep Learning*
Humans
Liver Neoplasms* / diagnosis
Liver Neoplasms* / pathology
Proteomics* / methods

Substances

Biomarkers, Tumor

Abstract

MeSH terms

Substances

Grants and funding