A statistical selection strategy for normalization procedures in LC-MS proteomics experiments through dataset-dependent ranking of normalization scaling factors

Bobbie-Jo M Webb-Robertson; Melissa M Matzke; Jon M Jacobs; Joel G Pounds; Katrina M Waters

doi:10.1002/pmic.201100078

A statistical selection strategy for normalization procedures in LC-MS proteomics experiments through dataset-dependent ranking of normalization scaling factors

Proteomics. 2011 Dec;11(24):4736-41. doi: 10.1002/pmic.201100078. Epub 2011 Nov 17.

Authors

Bobbie-Jo M Webb-Robertson¹, Melissa M Matzke, Jon M Jacobs, Joel G Pounds, Katrina M Waters

Affiliation

¹ Pacific Northwest National Laboratory, USA. [email protected]

Abstract

Quantification of LC-MS peak intensities assigned during peptide identification in a typical comparative proteomics experiment will deviate from run-to-run of the instrument due to both technical and biological variation. Thus, normalization of peak intensities across an LC-MS proteomics dataset is a fundamental step in pre-processing. However, the downstream analysis of LC-MS proteomics data can be dramatically affected by the normalization method selected. Current normalization procedures for LC-MS proteomics data are presented in the context of normalization values derived from subsets of the full collection of identified peptides. The distribution of these normalization values is unknown a priori. If they are not independent from the biological factors associated with the experiment the normalization process can introduce bias into the data, possibly affecting downstream statistical biomarker discovery. We present a novel approach to evaluate normalization strategies, which includes the peptide selection component associated with the derivation of normalization values. Our approach evaluates the effect of normalization on the between-group variance structure in order to identify the most appropriate normalization methods that improve the structure of the data without introducing bias into the normalized peak intensities.

Publication types

Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Biometry / methods*
Chromatography, Liquid / methods
Data Interpretation, Statistical
Mass Spectrometry / methods
Peptides
Proteins / analysis
Proteomics / instrumentation
Proteomics / methods*

Substances

Peptides
Proteins

Abstract

Publication types

MeSH terms

Substances

Grants and funding