Data-driven ontologies

Pac Symp Biocomput. 2009:15-26.

Abstract

Gene networks are important tools in studying gene-gene relationships and gene function. Understanding the relationships within these networks is an important challenge. Ontologies are a critical tool in helping deal with these data. The use of the Gene Ontology, for example, has become routine in methods for validation, discovery, etc. Here we present a novel algorithm that synthesizes an ontology by considering both extant annotation terms and also the connections between genes in gene networks. The process is efficient and produces easily inspectable ontologies. Because the relationships drawn between terms are heavily influenced by data, we call these "Data-Driven" Ontologies. We apply this algorithm to both discover new relationships between biological processes and as a tool to compare sets of genes across microrarray experiments. Supplemental data and source code are available at: http://www.ddont.org

MeSH terms

  • Algorithms*
  • Computational Biology / methods*
  • Data Interpretation, Statistical
  • Gene Regulatory Networks*
  • Genes, Fungal
  • Models, Genetic
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data
  • Saccharomyces cerevisiae / genetics