Leveraging three-dimensional chromatin architecture for effective reconstruction of enhancer-target gene regulatory interactions

Nucleic Acids Res. 2021 Sep 27;49(17):e97. doi: 10.1093/nar/gkab547.

Abstract

A growing amount of evidence in literature suggests that germline sequence variants and somatic mutations in non-coding distal regulatory elements may be crucial for defining disease risk and prognostic stratification of patients, in genetic disorders as well as in cancer. Their functional interpretation is challenging because genome-wide enhancer-target gene (ETG) pairing is an open problem in genomics. The solutions proposed so far do not account for the hierarchy of structural domains which define chromatin three-dimensional (3D) architecture. Here we introduce a change of perspective based on the definition of multi-scale structural chromatin domains, integrated in a statistical framework to define ETG pairs. In this work (i) we develop a computational and statistical framework to reconstruct a comprehensive map of ETG pairs leveraging functional genomics data; (ii) we demonstrate that the incorporation of chromatin 3D architecture information improves ETG pairing accuracy and (iii) we use multiple experimental datasets to extensively benchmark our method against previous solutions for the genome-wide reconstruction of ETG pairs. This solution will facilitate the annotation and interpretation of sequence variants in distal non-coding regulatory elements. We expect this to be especially helpful in clinically oriented applications of whole genome sequencing in cancer and undiagnosed genetic diseases research.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Cell Line
  • Cell Line, Tumor
  • Cells, Cultured
  • Chromatin / genetics*
  • Chromatin / metabolism
  • Computational Biology / methods*
  • Enhancer Elements, Genetic / genetics*
  • Epistasis, Genetic
  • Gene Expression Profiling / methods
  • Gene Expression Regulation*
  • Genome-Wide Association Study / methods
  • Genomics / methods
  • Humans
  • Neoplasms / genetics
  • Neoplasms / pathology
  • Polymorphism, Single Nucleotide
  • Promoter Regions, Genetic / genetics
  • Quantitative Trait Loci / genetics

Substances

  • Chromatin