Characterizing chromatin landscape from aggregate and single-cell genomic assays using flexible duration modeling

Nat Commun. 2020 Feb 6;11(1):747. doi: 10.1038/s41467-020-14497-5.

Abstract

ATAC-seq has become a leading technology for probing the chromatin landscape of single and aggregated cells. Distilling functional regions from ATAC-seq presents diverse analysis challenges. Methods commonly used to analyze chromatin accessibility datasets are adapted from algorithms designed to process different experimental technologies, disregarding the statistical and biological differences intrinsic to the ATAC-seq technology. Here, we present a Bayesian statistical approach that uses latent space models to better model accessible regions, termed ChromA. ChromA annotates chromatin landscape by integrating information from replicates, producing a consensus de-noised annotation of chromatin accessibility. ChromA can analyze single cell ATAC-seq data, correcting many biases generated by the sparse sampling inherent in single cell technologies. We validate ChromA on multiple technologies and biological systems, including mouse and human immune cells, establishing ChromA as a top performing general platform for mapping the chromatin landscape in different cellular populations from diverse experimental designs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Bayes Theorem
  • Chromatin / genetics*
  • Chromatin Immunoprecipitation Sequencing
  • Gene Library
  • Genomics / methods*
  • Humans
  • Markov Chains
  • Mice
  • Models, Genetic*
  • Molecular Sequence Annotation
  • Single-Cell Analysis

Substances

  • Chromatin