Predictive minimum description length principle approach to inferring gene regulatory networks

Adv Exp Med Biol. 2011:696:37-43. doi: 10.1007/978-1-4419-7046-6_4.

Abstract

Reverse engineering of gene regulatory networks using information theory models has received much attention due to its simplicity, low computational cost, and capability of inferring large networks. One of the major problems with information theory models is to determine the threshold that defines the regulatory relationships between genes. The minimum description length (MDL) principle has been implemented to overcome this problem. The description length of the MDL principle is the sum of model length and data encoding length. A user-specified fine tuning parameter is used as control mechanism between model and data encoding, but it is difficult to find the optimal parameter. In this work, we propose a new inference algorithm that incorporates mutual information (MI), conditional mutual information (CMI), and predictive minimum description length (PMDL) principle to infer gene regulatory networks from DNA microarray data. In this algorithm, the information theoretic quantities MI and CMI determine the regulatory relationships between genes and the PMDL principle method attempts to determine the best MI threshold without the need of a user-specified fine tuning parameter. The performance of the proposed algorithm is evaluated using both synthetic time series data sets and a biological time series data set (Saccharomyces cerevisiae). The results show that the proposed algorithm produced fewer false edges and significantly improved the precision when compared to existing MDL algorithm.

MeSH terms

  • Algorithms*
  • Computational Biology
  • DNA, Fungal / genetics
  • Databases, Nucleic Acid
  • Gene Regulatory Networks*
  • Genes, Fungal
  • Genetic Engineering
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data
  • Saccharomyces cerevisiae / genetics
  • Systems Biology

Substances

  • DNA, Fungal