Using graphical models and genomic expression data to statistically validate models of genetic regulatory networks

Pac Symp Biocomput. 2001:422-33. doi: 10.1142/9789814447362_0042.

Abstract

We propose a model-driven approach for analyzing genomic expression data that permits genetic regulatory networks to be represented in a biologically interpretable computational form. Our models permit latent variables capturing unobserved factors, describe arbitrarily complex (more than pair-wise) relationships at varying levels of refinement, and can be scored rigorously against observational data. The models that we use are based on Bayesian networks and their extensions. As a demonstration of this approach, we utilize 52 genomes worth of Affymetrix GeneChip expression data to correctly differentiate between alternative hypotheses of the galactose regulatory network in S. cerevisiae. When we extend the graph semantics to permit annotated edges, we are able to score models describing relationships at a finer degree of specification.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.
  • Validation Study

MeSH terms

  • Bayes Theorem
  • Galactose / metabolism
  • Gene Expression Profiling / statistics & numerical data*
  • Gene Expression Regulation, Fungal
  • Genome, Fungal
  • Models, Genetic*
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data
  • Saccharomyces cerevisiae / genetics
  • Saccharomyces cerevisiae / metabolism

Substances

  • Galactose