mdclust--exploratory microarray analysis by multidimensional clustering

M Dugas; S Merk; S Breit; P Dirschedl

doi:10.1093/bioinformatics/bth009

mdclust--exploratory microarray analysis by multidimensional clustering

Bioinformatics. 2004 Apr 12;20(6):931-6. doi: 10.1093/bioinformatics/bth009. Epub 2004 Jan 29.

Authors

M Dugas¹, S Merk, S Breit, P Dirschedl

Affiliation

¹ Department of Medical Informatics, Marchioninistr. 15, D-81377 Munich, Germany. [email protected]

PMID: 14751972
DOI: 10.1093/bioinformatics/bth009

Abstract

Motivation: Unsupervised clustering of microarray data may detect potentially important, but not obvious characteristics of samples, for instance subgroups of diagnoses with distinct gene profiles or systematic errors in experimentation.

Results: Multidimensional clustering (mdclust) is a method, which identifies sets of sample clusters and associated genes. It applies iteratively two-means clustering and score-based gene selection. For any phenotype variable best matching sets of clusters can be selected. This provides a method to identify gene-phenotype associations, suited even for settings with a large number of phenotype variables. An optional model based discriminant step may reduce further the number of selected genes.

Publication types

Comparative Study
Evaluation Study
Validation Study

MeSH terms

Cluster Analysis*
Computer Graphics
Gene Expression Profiling / methods*
Humans
Leukemia / genetics*
Oligonucleotide Array Sequence Analysis / methods*
Pattern Recognition, Automated
Reproducibility of Results
Sensitivity and Specificity
Sequence Alignment / methods*
Sequence Analysis, DNA / methods*
Software*