A Global Coexpression Network Approach for Connecting Genes to Specialized Metabolic Pathways in Plants

Plant Cell. 2017 May;29(5):944-959. doi: 10.1105/tpc.17.00009. Epub 2017 Apr 13.

Abstract

Plants produce diverse specialized metabolites (SMs), but the genes responsible for their production and regulation remain largely unknown, hindering efforts to tap plant pharmacopeia. Given that genes comprising SM pathways exhibit environmentally dependent coregulation, we hypothesized that genes within a SM pathway would form tight associations (modules) with each other in coexpression networks, facilitating their identification. To evaluate this hypothesis, we used 10 global coexpression data sets, each a meta-analysis of hundreds to thousands of experiments, across eight plant species to identify hundreds of coexpressed gene modules per data set. In support of our hypothesis, 15.3 to 52.6% of modules contained two or more known SM biosynthetic genes, and module genes were enriched in SM functions. Moreover, modules recovered many experimentally validated SM pathways, including all six known to form biosynthetic gene clusters (BGCs). In contrast, bioinformatically predicted BGCs (i.e., those lacking an associated metabolite) were no more coexpressed than the null distribution for neighboring genes. These results suggest that most predicted plant BGCs are not genuine SM pathways and argue that BGCs are not a hallmark of plant specialized metabolism. We submit that global gene coexpression is a rich, largely untapped resource for discovering the genetic basis and architecture of plant natural products.

MeSH terms

  • Computational Biology
  • Gene Expression Profiling
  • Gene Expression Regulation, Plant / genetics*
  • Gene Expression Regulation, Plant / physiology
  • Metabolic Networks and Pathways / genetics*
  • Metabolic Networks and Pathways / physiology*
  • Multigene Family / genetics