Polymorphisms involving gain or loss of CpG sites are significantly enriched in trait-associated SNPs

Oncotarget. 2015 Nov 24;6(37):39995-40004. doi: 10.18632/oncotarget.5650.

Abstract

Some single nucleotide polymorphisms (SNPs) influence the existence of CpG sites, the basis of DNA modification such as methylation and hydroxymethylation. These polymorphisms can lead to gain or loss of CpG sites and were defined as CpG site related SNPs (cgSNPs) in this study. The cgSNPs change DNA sequence and might potentially affect DNA modification such as methylation. However, the functional consequence of cgSNPs is poorly understood. We observed that a considerable proportion (23.0%) of common variants were cgSNPs in human genome. Mutations involving loss of CpG sites were associated with reduced levels of methylation (~20.2%) using The Cancer Genome Atlas (TCGA) data. Using public databases (SCAN and seeQTL) of expression quantitative trait loci (eQTLs), we found that the cgSNPs were significantly enriched in eQTLs via logistic regression and simulation test. Furthermore, we observed that cgSNPs were more likely to be trait-associated loci especially cancers using a catalog of published genome-wide association studies (GWAS) recorded by National Human Genome Research Institute (NHGRI). Our results indicated that cgSNP might be meaningful as annotation either in SNP functional prediction or in screening for trait-associated SNPs.

Keywords: CpG site; DNA methylation; cancer; epigenetic; single nucleotide polymorphism (SNP).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • CpG Islands / genetics*
  • DNA Methylation
  • Databases, Genetic
  • Genetic Association Studies / methods
  • Genetic Predisposition to Disease / genetics
  • Genome, Human / genetics*
  • Genome-Wide Association Study
  • Humans
  • Logistic Models
  • Models, Genetic
  • Neoplasms / genetics
  • Neoplasms / pathology
  • Phenotype
  • Polymorphism, Single Nucleotide*
  • Quantitative Trait Loci / genetics*