MMSplice: modular modeling improves the predictions of genetic variant effects on splicing

Genome Biol. 2019 Mar 1;20(1):48. doi: 10.1186/s13059-019-1653-z.

Abstract

Predicting the effects of genetic variants on splicing is highly relevant for human genetics. We describe the framework MMSplice (modular modeling of splicing) with which we built the winning model of the CAGI5 exon skipping prediction challenge. The MMSplice modules are neural networks scoring exon, intron, and splice sites, trained on distinct large-scale genomics datasets. These modules are combined to predict effects of variants on exon skipping, splice site choice, splicing efficiency, and pathogenicity, with matched or higher performance than state-of-the-art. Our models, available in the repository Kipoi, apply to variants including indels directly from VCF files.

Keywords: Deep learning; Modular modeling; Splicing; Variant effect; Variant pathogenicity.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing*
  • Genetic Diseases, Inborn
  • Genetic Variation*
  • Models, Genetic*
  • Neural Networks, Computer*