AdaLiftOver: high-resolution identification of orthologous regulatory elements with Adaptive liftOver

Bioinformatics. 2023 Apr 3;39(4):btad149. doi: 10.1093/bioinformatics/btad149.

Abstract

Motivation: Elucidating functionally similar orthologous regulatory regions for human and model organism genomes is critical for exploiting model organism research and advancing our understanding of results from genome-wide association studies (GWAS). Sequence conservation is the de facto approach for finding orthologous non-coding regions between human and model organism genomes. However, existing methods for mapping non-coding genomic regions across species are challenged by the multi-mapping, low precision, and low mapping rate issues.

Results: We develop Adaptive liftOver (AdaLiftOver), a large-scale computational tool for identifying functionally similar orthologous non-coding regions across species. AdaLiftOver builds on the UCSC liftOver framework to extend the query regions and prioritizes the resulting candidate target regions based on the conservation of the epigenomic and the sequence grammar features. Evaluations of AdaLiftOver with multiple case studies, spanning both genomic intervals from epigenome datasets across a wide range of model organisms and GWAS SNPs, yield AdaLiftOver as a versatile method for deriving hard-to-obtain human epigenome datasets as well as reliably identifying orthologous loci for GWAS SNPs.

Availability and implementation: The R package and the data for AdaLiftOver is available from https://github.com/keleslab/AdaLiftOver.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Genome
  • Genome-Wide Association Study*
  • Genomics / methods
  • Humans
  • Regulatory Sequences, Nucleic Acid*
  • Software