Normalization of single-channel DNA array data by principal component analysis

Bioinformatics. 2004 Jul 22;20(11):1772-84. doi: 10.1093/bioinformatics/bth170. Epub 2004 Mar 22.

Abstract

Motivation: Detailed comparison and analysis of the output of DNA gene expression arrays from multiple samples require global normalization of the measured individual gene intensities from the different hybridizations. This is needed for accounting for variations in array preparation and sample hybridization conditions.

Results: Here, we present a simple, robust and accurate procedure for the global normalization of datasets generated with single-channel DNA arrays based on principal component analysis. The procedure makes minimal assumptions about the data and performs well in cases where other standard procedures produced biased estimates. It is also insensitive to data transformation, filtering (thresholding) and pre-screening.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.
  • Validation Study

MeSH terms

  • Algorithms*
  • Computer Simulation
  • Epithelium / physiology
  • Female
  • Gene Expression Profiling / methods*
  • Gene Expression Profiling / standards
  • Gene Expression Regulation / physiology
  • Humans
  • Models, Genetic*
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis / methods*
  • Oligonucleotide Array Sequence Analysis / standards
  • Ovary / physiology
  • Principal Component Analysis*
  • Reproducibility of Results
  • Sensitivity and Specificity