Quantification of uncertainty of peptide retention time predictions from a sequence-based model in LC-MS/MS proteomics experiments

Annu Int Conf IEEE Eng Med Biol Soc. 2007:2007:1221-4. doi: 10.1109/IEMBS.2007.4352517.

Abstract

In high-throughput mass spectrometry-based proteomics, it is necessary to employ separations to reduce sample complexity prior to mass spectrometric peptide identification. Interest has begun to focus on using information from separations to aid in peptide identification. One of the most common separations is reversed-phase liquid chromatography, in which peptides are separated on the basis of their chromatographic retention time. We apply a sequence-based model of peptide hydrophobicity to the problem of predicting peptide retention times, first fitting the model parameters using a large set of peptide identifications and then testing its predictions using a set of completely different peptide identifications. We demonstrate that not only does the model provide reasonably accurate predictions, it also provides a quantification of the uncertainty of its predictions. The model may therefore be used to provide checks on future tentative peptide identifications, even when the peptide species in question has never been observed before.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Chromatography, High Pressure Liquid / methods*
  • Mass Spectrometry / methods*
  • Molecular Sequence Data
  • Peptide Mapping / methods*
  • Peptides / chemistry*
  • Proteome / chemistry*
  • Proteomics / methods*
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Sequence Analysis, Protein / methods

Substances

  • Peptides
  • Proteome