CAPICE: a computational method for Consequence-Agnostic Pathogenicity Interpretation of Clinical Exome variations

Genome Med. 2020 Aug 24;12(1):75. doi: 10.1186/s13073-020-00775-w.

Abstract

Exome sequencing is now mainstream in clinical practice. However, identification of pathogenic Mendelian variants remains time-consuming, in part, because the limited accuracy of current computational prediction methods requires manual classification by experts. Here we introduce CAPICE, a new machine-learning-based method for prioritizing pathogenic variants, including SNVs and short InDels. CAPICE outperforms the best general (CADD, GAVIN) and consequence-type-specific (REVEL, ClinPred) computational prediction methods, for both rare and ultra-rare variants. CAPICE is easily added to diagnostic pipelines as pre-computed score file or command-line software, or using online MOLGENIS web service with API. Download CAPICE for free and open-source (LGPLv3) at https://github.com/molgenis/capice .

Keywords: Allele frequency; Clinical genetics; Exome sequencing; Genome diagnostics; Machine learning; Molecular consequence; Variant pathogenicity prediction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods*
  • Exome*
  • Gene Frequency
  • Genetic Association Studies / methods
  • Genetic Variation*
  • Humans
  • INDEL Mutation
  • Machine Learning
  • Molecular Diagnostic Techniques
  • Molecular Sequence Annotation
  • Polymorphism, Single Nucleotide
  • ROC Curve
  • Reproducibility of Results
  • Software*