Predictive models of subcellular localization of long RNAs

RNA. 2019 May;25(5):557-572. doi: 10.1261/rna.068288.118. Epub 2019 Feb 11.

Abstract

Export to the cytoplasm is a key regulatory junction for both protein-coding mRNAs and long noncoding RNAs (lncRNAs), and cytoplasmic enrichment varies dramatically both within and between those groups. We used a new computational approach and RNA-seq data from human and mouse cells to quantify the genome-wide association between cytoplasmic/nuclear ratios of both gene groups and various factors, including expression levels, splicing efficiency, gene architecture, chromatin marks, and sequence elements. Splicing efficiency emerged as the main predictive factor, explaining up to a third of the variability in localization. Combination with other features allowed predictive models that could explain up to 45% of the variance for protein-coding genes and up to 34% for lncRNAs. Factors associated with localization were similar between lncRNAs and mRNAs with some important differences. Readily accessible features can thus be used to predict RNA localization.

Keywords: RNA localization; intron retention; long noncoding RNAs; nuclear export; post-transcriptional regulation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence
  • Biological Transport
  • Cell Line
  • Cell Nucleus / chemistry
  • Cell Nucleus / metabolism*
  • Chromatin / chemistry
  • Chromatin / metabolism
  • Cytoplasm / chemistry
  • Cytoplasm / metabolism*
  • Exons
  • Gene Ontology
  • Genome*
  • HeLa Cells
  • Hep G2 Cells
  • Humans
  • Introns
  • K562 Cells
  • Mice
  • Models, Genetic*
  • Molecular Sequence Annotation
  • RNA Splicing*
  • RNA, Long Noncoding / genetics*
  • RNA, Long Noncoding / metabolism
  • RNA, Messenger / genetics*
  • RNA, Messenger / metabolism

Substances

  • Chromatin
  • RNA, Long Noncoding
  • RNA, Messenger