The mutation spectrum in genomic late replication domains shapes mammalian GC content

Nucleic Acids Res. 2016 May 19;44(9):4222-32. doi: 10.1093/nar/gkw268. Epub 2016 Apr 16.

Abstract

Genome sequence compositions and epigenetic organizations are correlated extensively across multiple length scales. Replication dynamics, in particular, is highly correlated with GC content. We combine genome-wide time of replication (ToR) data, topological domains maps and detailed functional epigenetic annotations to study the correlations between replication timing and GC content at multiple scales. We find that the decrease in genomic GC content at large scale late replicating regions can be explained by mutation bias favoring A/T nucleotide, without selection or biased gene conversion. Quantification of the free dNTP pool during the cell cycle is consistent with a mechanism involving replication-coupled mutation spectrum that favors AT nucleotides at late S-phase. We suggest that mammalian GC content composition is shaped by independent forces, globally modulating mutation bias and locally selecting on functional element. Deconvoluting these forces and analyzing them on their native scales is important for proper characterization of complex genomic correlations.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Composition
  • Cell Line, Tumor
  • Chromatin / genetics
  • DNA Replication*
  • Evolution, Molecular
  • Genome, Human
  • Humans
  • Mutation

Substances

  • Chromatin