Hypomethylated domain-enriched DNA motifs prepattern the accessible nucleosome organization in teleosts

Epigenetics Chromatin. 2017 Sep 20;10(1):44. doi: 10.1186/s13072-017-0152-2.

Abstract

Background: Gene promoters in vertebrate genomes show distinct chromatin features such as stably positioned nucleosome array and DNA hypomethylation. The nucleosomes are known to have certain sequence preferences, and the prediction of nucleosome positioning from DNA sequence has been successful in some organisms such as yeast. However, at gene promoters where nucleosomes are much more stably positioned than in other regions, the sequence-based model has failed to work well, and sequence-independent mechanisms have been proposed.

Results: Using DNase I-seq in medaka embryos, we demonstrated that hypomethylated domains (HMDs) specifically possess accessible nucleosome organization with longer linkers, and we reassessed the DNA sequence preference for nucleosome positioning in these specific regions. Remarkably, we found with a supervised machine learning algorithm, k-mer SVM, that nucleosome positioning in HMDs is accurately predictable from DNA sequence alone. Specific short sequences (6-mers) that contribute to the prediction are specifically enriched in HMDs and distribute periodically with approximately 200-bp intervals which prepattern the position of accessible linkers. Surprisingly, the sequence preference of the nucleosome and linker in HMDs is opposite from that reported previously. Furthermore, the periodicity of specific motifs at hypomethylated promoters was conserved in zebrafish.

Conclusion: This study reveals strong link between nucleosome positioning and DNA sequence at vertebrate promoters, and we propose hypomethylated DNA-specific regulation of nucleosome positioning.

Keywords: DNA methylation; DNA sequence; Nucleosome positioning; Vertebrate.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Chromatin Assembly and Disassembly
  • DNA Methylation*
  • Fish Proteins / genetics*
  • Fish Proteins / metabolism
  • Machine Learning
  • Nucleosomes / genetics*
  • Nucleosomes / metabolism
  • Nucleotide Motifs*
  • Oryzias
  • Promoter Regions, Genetic
  • Sequence Analysis, DNA / methods

Substances

  • Fish Proteins
  • Nucleosomes