Mapping HDX-MS Data to Protein Conformations through Training Ensemble-Based Models

J Am Soc Mass Spectrom. 2023 Sep 6;34(9):1989-1997. doi: 10.1021/jasms.3c00145. Epub 2023 Aug 7.

Abstract

An original approach that adopts machine learning inference to predict protein structural information using hydrogen-deuterium exchange mass spectrometry (HDX-MS) is described. The method exploits an in-house optimization program that increases the resolution of HDX-MS data from peptides to amino acids. A system is trained using Gradient Tree Boosting as a type of machine learning ensemble technique to assign a protein secondary structure. Using limited training data we generate a discriminative model that uses optimized HDX-MS data to predict protein secondary structure with an accuracy of 75%. This research could form the basis for new methods exploiting artificial intelligence to model protein conformations by HDX-MS.

MeSH terms

  • Artificial Intelligence*
  • Deuterium Exchange Measurement / methods
  • Hydrogen Deuterium Exchange-Mass Spectrometry*
  • Mass Spectrometry / methods
  • Protein Conformation
  • Proteins / chemistry

Substances

  • Proteins