Abstract
We describe an algorithm for finding the most statistically significant non-overlapping subtrees of a hierarchical clustering of gene expression data with respect to a set of secondary data labels on genes. The method is implemented as a Java plug-in for a commercial gene expression analysis program (GeneSpring).
MeSH terms
-
Algorithms*
-
Cluster Analysis
-
Databases, Genetic
-
Documentation / methods
-
Gene Expression Profiling / methods*
-
Information Storage and Retrieval / methods*
-
Programming Languages
-
Sequence Alignment / methods*
-
Sequence Analysis, DNA / methods*
-
Software*
-
Systems Integration
-
User-Computer Interface*