Early Diagnosis: End-to-End CNN-LSTM Models for Mass Spectrometry Data Classification

Anal Chem. 2023 Sep 12;95(36):13431-13437. doi: 10.1021/acs.analchem.3c00613. Epub 2023 Aug 25.

Abstract

Liquid chromatography-mass spectrometry (LC-MS) is a powerful method for cell profiling. The use of LC-MS technology is a tool of choice for cancer research since it provides molecular fingerprints of analyzed tissues. However, the ubiquitous presence of noise, the peaks shift between acquisitions, and the huge amount of information owing to the high dimensionality of the data make rapid and accurate cancer diagnosis a challenging task. Deep learning (DL) models are not only effective classifiers but are also well suited to jointly learn feature representation and classification tasks. This is particularly relevant when applied to raw LC-MS data and hence avoid the need for costly preprocessing and complicated feature selection. In this study, we propose a new end-to-end DL methodology that addresses all of the above challenges at once, while preserving the high potential of LC-MS data. Our DL model is designed to early discriminate between tumoral and normal tissues. It is a combination of a convolutional neural network (CNN) and a long short-term memory (LSTM) Network. The CNN network allows for significantly reducing the high dimensionality of the data while learning spatially relevant features. The LSTM network enables our model to capture temporal patterns. We show that our model outperforms not only benchmark models but also state-of-the-art models developed on the same data. Our framework is a promising strategy for improving early cancer detection during a diagnostic process.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Benchmarking*
  • Chromatography, Liquid
  • Early Detection of Cancer*
  • Mass Spectrometry
  • Neural Networks, Computer