Mixture modelling of gene expression data from microarray experiments

Bioinformatics. 2002 Feb;18(2):275-86. doi: 10.1093/bioinformatics/18.2.275.

Abstract

Motivation: Hierarchical clustering is one of the major analytical tools for gene expression data from microarray experiments. A major problem in the interpretation of the output from these procedures is assessing the reliability of the clustering results. We address this issue by developing a mixture model-based approach for the analysis of microarray data. Within this framework, we present novel algorithms for clustering genes and samples. One of the byproducts of our method is a probabilistic measure for the number of true clusters in the data.

Results: The proposed methods are illustrated by application to microarray datasets from two cancer studies; one in which malignant melanoma is profiled (Bittner et al., Nature, 406, 536-540, 2000), and the other in which prostate cancer is profiled (Dhanasekaran et al., 2001, submitted).

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Computational Biology
  • Databases, Genetic
  • Gene Expression Profiling / statistics & numerical data*
  • Humans
  • Male
  • Melanoma / genetics
  • Models, Genetic*
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data*
  • Prostatic Neoplasms / genetics
  • Skin Neoplasms / genetics