Ensemble learning of genetic networks from time-series expression data

Bioinformatics. 2007 Dec 1;23(23):3225-31. doi: 10.1093/bioinformatics/btm514. Epub 2007 Oct 31.

Abstract

Motivation: Inferring genetic networks from time-series expression data has been a great deal of interest. In most cases, however, the number of genes exceeds that of data points which, in principle, makes it impossible to recover the underlying networks. To address the dimensionality problem, we apply the subset selection method to a linear system of difference equations. Previous approaches assign the single most likely combination of regulators to each target gene, which often causes over-fitting of the small number of data.

Results: Here, we propose a new algorithm, named LEARNe, which merges the predictions from all the combinations of regulators that have a certain level of likelihood. LEARNe provides more accurate and robust predictions than previous methods for the structure of genetic networks under the linear system model. We tested LEARNe for reconstructing the SOS regulatory network of Escherichia coli and the cell cycle regulatory network of yeast from real experimental data, where LEARNe also exhibited better performances than previous methods.

Availability: The MATLAB codes are available upon request from the authors.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Computer Simulation
  • Gene Expression / physiology*
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation / physiology*
  • Models, Biological*
  • Proteome / metabolism*
  • Signal Transduction / physiology*
  • Time Factors

Substances

  • Proteome