Distribution bias analysis of germline and somatic single-nucleotide variations that impact protein functional site and neighboring amino acids

Sci Rep. 2017 Feb 8:7:42169. doi: 10.1038/srep42169.

Abstract

Single nucleotide variations (SNVs) can result in loss or gain of protein functional sites. We analyzed the effects of SNVs on enzyme active sites, ligand binding sites, and various types of post translational modification (PTM) sites. We found that, for most types of protein functional sites, the SNV pattern differs between germline and somatic mutations as well as between synonymous and non-synonymous mutations. From a total of 51,138 protein functional site affecting SNVs (pfsSNVs), a pan-cancer analysis revealed 142 somatic pfsSNVs in five or more cancer types. By leveraging patient information for somatic pfsSNVs, we identified 17 loss of functional site SNVs and 60 gain of functional site SNVs which are significantly enriched in patients with specific cancer types. Of the key pfsSNVs identified in our analysis above, we highlight 132 key pfsSNVs within 17 genes that are found in well-established cancer associated gene lists. For illustrating how key pfsSNVs can be prioritized further, we provide a use case where we performed survival analysis showing that a loss of phosphorylation site pfsSNV at position 105 in MEF2A is significantly associated with decreased pancreatic cancer patient survival rate. These 132 pfsSNVs can be used in developing genetic testing pipelines.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acetylation
  • Catalytic Domain
  • Databases, Genetic
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic*
  • Gene Ontology
  • Germ-Line Mutation*
  • Glycosylation
  • Humans
  • Methylation
  • Molecular Sequence Annotation
  • Neoplasm Proteins / chemistry
  • Neoplasm Proteins / genetics*
  • Neoplasm Proteins / metabolism
  • Neoplasms / genetics*
  • Neoplasms / metabolism
  • Neoplasms / mortality
  • Neoplasms / pathology
  • Phosphorylation
  • Polymorphism, Single Nucleotide*
  • Protein Processing, Post-Translational*
  • Survival Analysis
  • Ubiquitination

Substances

  • Neoplasm Proteins