Protein identification using top-down

Mol Cell Proteomics. 2012 Jun;11(6):M111.008524. doi: 10.1074/mcp.M111.008524. Epub 2011 Oct 25.

Abstract

In the last two years, because of advances in protein separation and mass spectrometry, top-down mass spectrometry moved from analyzing single proteins to analyzing complex samples and identifying hundreds and even thousands of proteins. However, computational tools for database search of top-down spectra against protein databases are still in their infancy. We describe MS-Align+, a fast algorithm for top-down protein identification based on spectral alignment that enables searches for unexpected post-translational modifications. We also propose a method for evaluating statistical significance of top-down protein identifications and further benchmark various software tools on two top-down data sets from Saccharomyces cerevisiae and Salmonella typhimurium. We demonstrate that MS-Align+ significantly increases the number of identified spectra as compared with MASCOT and OMSSA on both data sets. Although MS-Align+ and ProSightPC have similar performance on the Salmonella typhimurium data set, MS-Align+ outperforms ProSightPC on the (more complex) Saccharomyces cerevisiae data set.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Bacterial Proteins / chemistry*
  • Bacterial Proteins / metabolism
  • Data Interpretation, Statistical
  • Molecular Sequence Annotation
  • Molecular Sequence Data
  • Molecular Weight
  • Peptide Mapping / methods*
  • Protein Processing, Post-Translational
  • Proteome / chemistry*
  • Proteome / metabolism
  • Saccharomyces cerevisiae
  • Saccharomyces cerevisiae Proteins / chemistry*
  • Saccharomyces cerevisiae Proteins / metabolism
  • Salmonella typhimurium
  • Software*
  • Tandem Mass Spectrometry

Substances

  • Bacterial Proteins
  • Proteome
  • Saccharomyces cerevisiae Proteins