Identifying transposon insertions and their effects from RNA-sequencing data

Nucleic Acids Res. 2017 Jul 7;45(12):7064-7077. doi: 10.1093/nar/gkx461.

Abstract

Insertional mutagenesis using engineered transposons is a potent forward genetic screening technique used to identify cancer genes in mouse model systems. In the analysis of these screens, transposon insertion sites are typically identified by targeted DNA-sequencing and subsequently assigned to predicted target genes using heuristics. As such, these approaches provide no direct evidence that insertions actually affect their predicted targets or how transcripts of these genes are affected. To address this, we developed IM-Fusion, an approach that identifies insertion sites from gene-transposon fusions in standard single- and paired-end RNA-sequencing data. We demonstrate IM-Fusion on two separate transposon screens of 123 mammary tumors and 20 B-cell acute lymphoblastic leukemias, respectively. We show that IM-Fusion accurately identifies transposon insertions and their true target genes. Furthermore, by combining the identified insertion sites with expression quantification, we show that we can determine the effect of a transposon insertion on its target gene(s) and prioritize insertions that have a significant effect on expression. We expect that IM-Fusion will significantly enhance the accuracy of cancer gene discovery in forward genetic screens and provide initial insight into the biological effects of insertions on candidate cancer genes.

MeSH terms

  • Acute Disease
  • Animals
  • Breast Neoplasms / genetics*
  • Breast Neoplasms / metabolism
  • Chromosome Mapping / methods
  • DNA Transposable Elements*
  • Datasets as Topic
  • Disease Models, Animal
  • Female
  • High-Throughput Nucleotide Sequencing
  • High-Throughput Screening Assays
  • Humans
  • Leukemia, B-Cell / genetics*
  • Leukemia, B-Cell / metabolism
  • Mice
  • Mutagenesis, Insertional*
  • Neoplasm Proteins / genetics*
  • Neoplasm Proteins / metabolism
  • Software*

Substances

  • DNA Transposable Elements
  • Neoplasm Proteins