A resource of variant effect predictions of single nucleotide variants in model organisms

Mol Syst Biol. 2018 Dec 20;14(12):e8430. doi: 10.15252/msb.20188430.

Abstract

The effect of single nucleotide variants (SNVs) in coding and noncoding regions is of great interest in genetics. Although many computational methods aim to elucidate the effects of SNVs on cellular mechanisms, it is not straightforward to comprehensively cover different molecular effects. To address this, we compiled and benchmarked sequence and structure-based variant effect predictors and we computed the impact of nearly all possible amino acid and nucleotide variants in the reference genomes of Homo sapiens, Saccharomyces cerevisiae and Escherichia coli Studied mechanisms include protein stability, interaction interfaces, post-translational modifications and transcription factor binding sites. We apply this resource to the study of natural and disease coding variants. We also show how variant effects can be aggregated to generate protein complex burden scores that uncover protein complex to phenotype associations based on a set of newly generated growth profiles of 93 sequenced S. cerevisiae strains in 43 conditions. This resource is available through mutfunc (www.mutfunc.com), a tool by which users can query precomputed predictions by providing amino acid or nucleotide-level variants.

Keywords: burden score; genetic variants; genotype‐to‐phenotype; model organisms; resource.

MeSH terms

  • Computational Biology / methods*
  • Escherichia coli / genetics
  • Genome, Bacterial / genetics
  • Genome, Fungal / genetics
  • Genome, Human / genetics
  • Genotype
  • Humans
  • Molecular Sequence Annotation
  • Polymorphism, Single Nucleotide / genetics*
  • Protein Stability
  • Saccharomyces cerevisiae / genetics
  • Software*