A machine learning approach based on ACMG/AMP guidelines for genomic variant classification and prioritization

Sci Rep. 2022 Feb 15;12(1):2517. doi: 10.1038/s41598-022-06547-3.

Abstract

Genomic variant interpretation is a critical step of the diagnostic procedure, often supported by the application of tools that may predict the damaging impact of each variant or provide a guidelines-based classification. We propose the application of Machine Learning methodologies, in particular Penalized Logistic Regression, to support variant classification and prioritization. Our approach combines ACMG/AMP guidelines for germline variant interpretation as well as variant annotation features and provides a probabilistic score of pathogenicity, thus supporting the prioritization and classification of variants that would be interpreted as uncertain by the ACMG/AMP guidelines. We compared different approaches in terms of variant prioritization and classification on different datasets, showing that our data-driven approach is able to solve more variant of uncertain significance (VUS) cases in comparison with guidelines-based approaches and in silico prediction tools.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Cohort Studies
  • Computer Simulation
  • Genetic Predisposition to Disease*
  • Genetic Testing / methods
  • Genetic Variation*
  • Genome, Human
  • Genomics / methods
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Logistic Models
  • Machine Learning*
  • Neoplasms / diagnosis
  • Neoplasms / genetics*
  • Practice Guidelines as Topic*
  • Research Design
  • Software