Current algorithmic solutions for peptide-based proteomics data generation and identification

Michael R Hoopmann; Robert L Moritz

doi:10.1016/j.copbio.2012.10.013

Current algorithmic solutions for peptide-based proteomics data generation and identification

Curr Opin Biotechnol. 2013 Feb;24(1):31-8. doi: 10.1016/j.copbio.2012.10.013. Epub 2012 Nov 8.

Authors

Michael R Hoopmann¹, Robert L Moritz

Affiliation

¹ Institute for Systems Biology, Seattle, WA 98109, USA.

Abstract

Peptide-based proteomic data sets are ever increasing in size and complexity. These data sets provide computational challenges when attempting to quickly analyze spectra and obtain correct protein identifications. Database search and de novo algorithms must consider high-resolution MS/MS spectra and alternative fragmentation methods. Protein inference is a tricky problem when analyzing large data sets of degenerate peptide identifications. Combining multiple algorithms for improved peptide identification puts significant strain on computational systems when investigating large data sets. This review highlights some of the recent developments in peptide and protein identification algorithms for analyzing shotgun mass spectrometry data when encountering the aforementioned hurdles. Also explored are the roles that analytical pipelines, public spectral libraries, and cloud computing play in the evolution of peptide-based proteomics.

Publication types

Research Support, American Recovery and Reinvestment Act
Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, Non-P.H.S.
Review

MeSH terms

Algorithms*
Databases, Protein
Humans
Mass Spectrometry / methods*
Peptides / analysis*
Peptides / chemistry*
Proteins / analysis
Proteins / chemistry
Proteomics / methods*
Tandem Mass Spectrometry / methods

Substances

Peptides
Proteins

Abstract

Publication types

MeSH terms

Substances

Grants and funding