Singular value decomposition for genome-wide expression data processing and modeling

Proc Natl Acad Sci U S A. 2000 Aug 29;97(18):10101-6. doi: 10.1073/pnas.97.18.10101.

Abstract

We describe the use of singular value decomposition in transforming genome-wide expression data from genes x arrays space to reduced diagonalized "eigengenes" x "eigenarrays" space, where the eigengenes (or eigenarrays) are unique orthonormal superpositions of the genes (or arrays). Normalizing the data by filtering out the eigengenes (and eigenarrays) that are inferred to represent noise or experimental artifacts enables meaningful comparison of the expression of different genes across different arrays in different experiments. Sorting the data according to the eigengenes and eigenarrays gives a global picture of the dynamics of gene expression, in which individual genes and arrays appear to be classified into groups of similar regulation and function, or similar cellular state and biological phenotype, respectively. After normalization and sorting, the significant eigengenes and eigenarrays can be associated with observed genome-wide effects of regulators, or with measured samples, in which these regulators are overactive or underactive, respectively.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Cell Cycle
  • Genome*
  • Genome, Fungal
  • Mathematics
  • Models, Biological
  • Models, Genetic*
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis*
  • Open Reading Frames
  • Saccharomyces cerevisiae / cytology
  • Saccharomyces cerevisiae / genetics