Advanced methylome analysis after bisulfite deep sequencing: an example in Arabidopsis

PLoS One. 2012;7(7):e41528. doi: 10.1371/journal.pone.0041528. Epub 2012 Jul 20.

Abstract

Deep sequencing after bisulfite conversion (BS-Seq) is the method of choice to generate whole genome maps of cytosine methylation at single base-pair resolution. Its application to genomic DNA of Arabidopsis flower bud tissue resulted in the first complete methylome, determining a methylation rate of 6.7% in this tissue. BS-Seq reads were mapped onto an in silico converted reference genome, applying the so-called 3-letter genome method. Here, we present BiSS (Bisufite Sequencing Scorer), a new method applying Smith-Waterman alignment to map bisulfite-converted reads to a reference genome. In addition, we introduce a comprehensive adaptive error estimate that accounts for sequencing errors, erroneous bisulfite conversion and also wrongly mapped reads. The re-analysis of the Arabidopsis methylome data with BiSS mapped substantially more reads to the genome. As a result, it determines the methylation status of an extra 10% of cytosines and estimates the methylation rate to be 7.7%. We validated the results by individual traditional bisulfite sequencing for selected genomic regions. In addition to predicting the methylation status of each cytosine, BiSS also provides an estimate of the methylation degree at each genomic site. Thus, BiSS explores BS-Seq data more extensively and provides more information for downstream analysis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / genetics*
  • Base Sequence
  • Cytosine / metabolism
  • DNA Methylation / genetics*
  • Genome, Plant / genetics
  • High-Throughput Nucleotide Sequencing / methods*
  • Reference Standards
  • Reproducibility of Results
  • Sequence Alignment
  • Sequence Analysis, DNA*
  • Sulfites / chemistry*

Substances

  • Sulfites
  • Cytosine
  • hydrogen sulfite