Current algorithmic solutions for peptide-based proteomics data generation and identification

Curr Opin Biotechnol. 2013 Feb;24(1):31-8. doi: 10.1016/j.copbio.2012.10.013. Epub 2012 Nov 8.

Abstract

Peptide-based proteomic data sets are ever increasing in size and complexity. These data sets provide computational challenges when attempting to quickly analyze spectra and obtain correct protein identifications. Database search and de novo algorithms must consider high-resolution MS/MS spectra and alternative fragmentation methods. Protein inference is a tricky problem when analyzing large data sets of degenerate peptide identifications. Combining multiple algorithms for improved peptide identification puts significant strain on computational systems when investigating large data sets. This review highlights some of the recent developments in peptide and protein identification algorithms for analyzing shotgun mass spectrometry data when encountering the aforementioned hurdles. Also explored are the roles that analytical pipelines, public spectral libraries, and cloud computing play in the evolution of peptide-based proteomics.

Publication types

  • Research Support, American Recovery and Reinvestment Act
  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Review

MeSH terms

  • Algorithms*
  • Databases, Protein
  • Humans
  • Mass Spectrometry / methods*
  • Peptides / analysis*
  • Peptides / chemistry*
  • Proteins / analysis
  • Proteins / chemistry
  • Proteomics / methods*
  • Tandem Mass Spectrometry / methods

Substances

  • Peptides
  • Proteins