Inferring gene expression from ribosomal promoter sequences, a crowdsourcing approach

Genome Res. 2013 Nov;23(11):1928-37. doi: 10.1101/gr.157420.113. Epub 2013 Aug 15.

Abstract

The Gene Promoter Expression Prediction challenge consisted of predicting gene expression from promoter sequences in a previously unknown experimentally generated data set. The challenge was presented to the community in the framework of the sixth Dialogue for Reverse Engineering Assessments and Methods (DREAM6), a community effort to evaluate the status of systems biology modeling methodologies. Nucleotide-specific promoter activity was obtained by measuring fluorescence from promoter sequences fused upstream of a gene for yellow fluorescence protein and inserted in the same genomic site of yeast Saccharomyces cerevisiae. Twenty-one teams submitted results predicting the expression levels of 53 different promoters from yeast ribosomal protein genes. Analysis of participant predictions shows that accurate values for low-expressed and mutated promoters were difficult to obtain, although in the latter case, only when the mutation induced a large change in promoter activity compared to the wild-type sequence. As in previous DREAM challenges, we found that aggregation of participant predictions provided robust results, but did not fare better than the three best algorithms. Finally, this study not only provides a benchmark for the assessment of methods predicting activity of a specific set of promoters from their sequence, but it also shows that the top performing algorithm, which used machine-learning approaches, can be improved by the addition of biological features such as transcription factor binding sites.

MeSH terms

  • Algorithms
  • Binding Sites / genetics
  • Crowdsourcing*
  • Gene Expression Profiling
  • Gene Expression Regulation, Fungal
  • Gene Expression*
  • Gene Regulatory Networks
  • Genes, Fungal
  • Models, Genetic
  • Mutation
  • Promoter Regions, Genetic*
  • Regulatory Elements, Transcriptional
  • Ribosomal Proteins / genetics*
  • Ribosomes / genetics*
  • Ribosomes / metabolism
  • Saccharomyces cerevisiae / genetics*
  • Saccharomyces cerevisiae / metabolism
  • Saccharomyces cerevisiae Proteins / genetics
  • Saccharomyces cerevisiae Proteins / metabolism
  • Systems Biology

Substances

  • Ribosomal Proteins
  • Saccharomyces cerevisiae Proteins