Non-referenced genome assembly from epigenomic short-read data

Epigenetics. 2014 Oct;9(10):1329-38. doi: 10.4161/15592294.2014.969610.

Abstract

Current computational methods used to analyze changes in DNA methylation and chromatin modification rely on sequenced genomes. Here we describe a pipeline for the detection of these changes from short-read sequence data that does not require a reference genome. Open source software packages were used for sequence assembly, alignment, and measurement of differential enrichment. The method was evaluated by comparing results with reference-based results showing a strong correlation between chromatin modification and gene expression. We then used our de novo sequence assembly to build the DNA methylation profile for the non-referenced Psammomys obesus genome. The pipeline described uses open source software for fast annotation and visualization of unreferenced genomic regions from short-read data.

Keywords: ChIP-seq; ChIP-seq, immunoprecipitated chromatin sequencing; DMR, differentially methylated region; DNA methylation; High-throughput sequencing; MBD-seq; MBD-seq, methyl binding domain protein sequencing; MeDIP-seq; methylated DNA immunoprecipitation sequencing; Psammomys obesus; RNA-seq, RNA sequencing; de novo assembly; epigenomic integration.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computational Biology
  • DNA Methylation*
  • Drosophila melanogaster
  • Epigenomics / methods*
  • Gerbillinae / genetics
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Male
  • Mice
  • Sequence Alignment
  • Sequence Analysis, DNA / methods*
  • Software
  • Transcriptome