Expanded methyl-sensitive cut counting reveals hypomethylation as an epigenetic state that highlights functional sequences of the genome

Proc Natl Acad Sci U S A. 2011 Jun 7;108(23):9715-20. doi: 10.1073/pnas.1105713108. Epub 2011 May 20.

Abstract

Methyl-sensitive cut counting (MSCC) with the HpaII methylation-sensitive restriction enzyme is a cost-effective method to pinpoint unmethylated CpGs at single base-pair resolution. However, it has the drawback of addressing only CpGs in the context of the CCGG site, leaving out the remainder of the possible 16 XCGX tetranucleotides in which CpGs are found. We expanded MSCC to include three additional enzymes to address a total of 5 of the 16 XCGX combinations. This allowed us to survey methylation at about one-third of all a mammalian genome's CpGs. Applied to mouse liver DNA, we correctly confirmed data reported with other methods showing hypomethylation to be concentrated at promoters and in CpG islands (CGIs), with gene bodies and intergenic regions being mostly methylated. Grouping unmethylated CpGs, characterized by high MSCC scores (7% false discovery rate), we found a large number of unmethylated regions not qualifying as CGIs located in intergenic and intronic regions, which are highly enriched in functional DNA sequences (open regulatory annotation database) as well as in noncoding yet highly conserved mammalian sequences thought to be important but with as yet unknown function. About 50% of MSCC-defined unmethylated regions do not overlap algorithm-defined CGIs and offer a novel search space in which new functionalities of DNA may be found in health and disease.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Animals
  • Base Sequence
  • CpG Islands / genetics*
  • DNA / chemistry
  • DNA / genetics
  • DNA Methylation*
  • Epigenomics / methods*
  • Genome / genetics*
  • Male
  • Mice
  • Polymerase Chain Reaction
  • Promoter Regions, Genetic / genetics
  • Sequence Analysis, DNA

Substances

  • DNA