Inferential, robust non-negative matrix factorization analysis of microarray data

Paul Fogel; S Stanley Young; Douglas M Hawkins; Nathalie Ledirac

doi:10.1093/bioinformatics/btl550

Inferential, robust non-negative matrix factorization analysis of microarray data

Bioinformatics. 2007 Jan 1;23(1):44-9. doi: 10.1093/bioinformatics/btl550. Epub 2006 Nov 8.

Authors

Paul Fogel¹, S Stanley Young, Douglas M Hawkins, Nathalie Ledirac

Affiliation

¹ Consultant 4 rue Le Goff, F-75005, Paris, France.

PMID: 17092989
DOI: 10.1093/bioinformatics/btl550

Abstract

Motivation: Modern methods such as microarrays, proteomics and metabolomics often produce datasets where there are many more predictor variables than observations. Research in these areas is often exploratory; even so, there is interest in statistical methods that accurately point to effects that are likely to replicate. Correlations among predictors are used to improve the statistical analysis. We exploit two ideas: non-negative matrix factorization methods that create ordered sets of predictors; and statistical testing within ordered sets which is done sequentially, removing the need for correction for multiple testing within the set.

Results: Simulations and theory point to increased statistical power. Computational algorithms are described in detail. The analysis and biological interpretation of a real dataset are given. In addition to the increased power, the benefit of our method is that the organized gene lists are likely to lead better understanding of the biology.

Availability: An SAS JMP executable script is available from http://www.niss.org/irMF

MeSH terms

Algorithms*
Analysis of Variance
Computational Biology / methods*
Databases, Genetic
Gene Expression Regulation, Leukemic / genetics*
Humans
Leukemia, Myeloid / genetics
Models, Genetic
Multigene Family
Oligonucleotide Array Sequence Analysis / methods*
Pattern Recognition, Automated / methods
Precursor Cell Lymphoblastic Leukemia-Lymphoma / genetics
Sequence Analysis, DNA / methods*