Universal and confident phosphorylation site localization using phosphoRS

J Proteome Res. 2011 Dec 2;10(12):5354-62. doi: 10.1021/pr200611n. Epub 2011 Nov 10.

Abstract

An algorithm for the assignment of phosphorylation sites in peptides is described. The program uses tandem mass spectrometry data in conjunction with the respective peptide sequences to calculate site probabilities for all potential phosphorylation sites. Tandem mass spectra from synthetic phosphopeptides were used for optimization of the scoring parameters employing all commonly used fragmentation techniques. Calculation of probabilities was adapted to the different fragmentation methods and to the maximum mass deviation of the analysis. The software includes a novel approach to peak extraction, required for matching experimental data to the theoretical values of all isoforms, by defining individual peak depths for the different regions of the tandem mass spectrum. Mixtures of synthetic phosphopeptides were used to validate the program by calculation of its false localization rate versus site probability cutoff characteristic. Notably, the empirical obtained precision was higher than indicated by the applied probability cutoff. In addition, the performance of the algorithm was compared to existing approaches to site localization such as Ascore. In order to assess the practical applicability of the algorithm to large data sets, phosphopeptides from a biological sample were analyzed, localizing more than 3000 nonredundant phosphorylation sites. Finally, the results obtained for the different fragmentation methods and localization tools were compared and discussed.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms*
  • Binding Sites
  • Chemical Fractionation / methods
  • HeLa Cells
  • Humans
  • Phosphopeptides / chemical synthesis
  • Phosphopeptides / chemistry*
  • Phosphopeptides / isolation & purification
  • Phosphorylation
  • Protein Isoforms / chemistry
  • Proteomics / methods*
  • Proteomics / standards
  • Reproducibility of Results
  • Software*
  • Tandem Mass Spectrometry

Substances

  • Phosphopeptides
  • Protein Isoforms