Prediction of nucleosome positioning by the incorporation of frequencies and distributions of three different nucleotide segment lengths into a general pseudo k-tuple nucleotide composition

Bioinformatics. 2017 Jan 1;33(1):42-48. doi: 10.1093/bioinformatics/btw562. Epub 2016 Aug 25.

Abstract

Motivation: Nucleosome positioning plays important roles in many eukaryotic intranuclear processes, such as transcriptional regulation and chromatin structure formation. The investigations of nucleosome positioning rules provide a deeper understanding of these intracellular processes.

Results: Nucleosome positioning prediction was performed using a model consisting of three types of variables characterizing a DNA sequence-the number of five-nucleotide sequences, the number of three-nucleotide combinations in one period of a helix, and mono- and di-nucleotide distributions in DNA fragments. Using recently proposed stringent benchmark datasets with low biases for Saccharomyces cerevisiae, Homo sapiens, Caenorhabditis elegans and Drosophila melanogaster, the present model was shown to have a better prediction performance than the recently proposed predictors. This model was able to display the common and organism-dependent factors that affect nucleosome forming and inhibiting sequences as well. Therefore, the predictors developed here can accurately predict nucleosome positioning and help determine the key factors influencing this process.

Contact: [email protected] information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Animals
  • Base Sequence
  • Caenorhabditis elegans / genetics
  • Chromatin Assembly and Disassembly*
  • DNA / metabolism
  • Drosophila melanogaster / genetics
  • Humans
  • Models, Genetic*
  • Nucleosomes / metabolism*
  • Saccharomyces cerevisiae / genetics

Substances

  • Nucleosomes
  • DNA