Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing

Nat Commun. 2016 Jun 24:7:11708. doi: 10.1038/ncomms11708.

Abstract

Zea mays is an important genetic model for elucidating transcriptional networks. Uncertainties about the complete structure of mRNA transcripts limit the progress of research in this system. Here, using single-molecule sequencing technology, we produce 111,151 transcripts from 6 tissues capturing ∼70% of the genes annotated in maize RefGen_v3 genome. A large proportion of transcripts (57%) represent novel, sometimes tissue-specific, isoforms of known genes and 3% correspond to novel gene loci. In other cases, the identified transcripts have improved existing gene models. Averaging across all six tissues, 90% of the splice junctions are supported by short reads from matched tissues. In addition, we identified a large number of novel long non-coding RNAs and fusion transcripts and found that DNA methylation plays an important role in generating various isoforms. Our results show that characterization of the maize B73 transcriptome is far from complete, and that maize gene expression is more complex than previously thought.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Gene Expression Profiling / methods*
  • Gene Expression Regulation, Plant
  • Plant Proteins / genetics
  • Plant Proteins / metabolism*
  • Polymerase Chain Reaction
  • Sequence Analysis, RNA / methods
  • Transcriptome / genetics*
  • Zea mays / genetics*

Substances

  • Plant Proteins