Generative Models for Quantification of DNA Modifications

Methods Mol Biol. 2018:1807:37-50. doi: 10.1007/978-1-4939-8561-6_4.

Abstract

There are multiple chemical modifications of cytosine that are important to the regulation and ultimately the functional expression of the genome. To date no single experiment can capture these separate modifications, and integrative experimental designs are needed to fully characterize cytosine methylation and chemical modification. This chapter describes a generative probabilistic model, Lux, for integrative analysis of cytosine methylation and its oxidized variants. Lux simultaneously analyzes partially orthogonal bisulfite sequencing data sets to estimate proportions of different cytosine methylation modifications and estimate multiple cytosine modifications for a single sample by integrating across experimental designs composed of multiple parallel destructive genomic measurements. Lux also considers the variation in measurements introduced by different imperfect experimental steps; the experimental variation can be quantified by using appropriate spike-in controls, allowing Lux to deconvolve the measurements and recover accurately the underlying signal.

Keywords: 5-methylcytosine oxidation; BS-seq/oxBS-seq/TAB-seq/fCAB-seq/CAB-seq/redBS-seq/MAB-seq; Bayesian analysis; Bisulfite sequencing; DNA methylation; Hierarchical generative modeling.

MeSH terms

  • DNA / genetics*
  • DNA Methylation
  • Genome
  • Quality Control
  • Sequence Analysis, DNA / methods*
  • Sulfites / metabolism

Substances

  • Sulfites
  • DNA
  • hydrogen sulfite