General statistical modeling of data from protein relative expression isobaric tags

J Proteome Res. 2011 Jun 3;10(6):2758-66. doi: 10.1021/pr1012784. Epub 2011 May 11.

Abstract

Quantitative comparison of the protein content of biological samples is a fundamental tool of research. The TMT and iTRAQ isobaric labeling technologies allow the comparison of 2, 4, 6, or 8 samples in one mass spectrometric analysis. Sound statistical models that scale with the most advanced mass spectrometry (MS) instruments are essential for their efficient use. Through the application of robust statistical methods, we developed models that capture variability from individual spectra to biological samples. Classical experimental designs with a distinct sample in each channel as well as the use of replicates in multiple channels are integrated into a single statistical framework. We have prepared complex test samples including controlled ratios ranging from 100:1 to 1:100 to characterize the performance of our method. We demonstrate its application to actual biological data sets originating from three different laboratories and MS platforms. Finally, test data and an R package, named isobar, which can read Mascot, Phenyx, and mzIdentML files, are made available. The isobar package can also be used as an independent software that requires very little or no R programming skills.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Blood Proteins / chemistry
  • Ceruloplasmin / chemistry
  • Data Interpretation, Statistical
  • Humans
  • Mice
  • Models, Statistical*
  • Proteomics / methods*
  • Rats
  • Software*

Substances

  • Blood Proteins
  • Ceruloplasmin