Base-pair resolution detection of transcription factor binding site by deep deconvolutional network

Bioinformatics. 2018 Oct 15;34(20):3446-3453. doi: 10.1093/bioinformatics/bty383.

Abstract

Motivation: Transcription factor (TF) binds to the promoter region of a gene to control gene expression. Identifying precise TF binding sites (TFBSs) is essential for understanding the detailed mechanisms of TF-mediated gene regulation. However, there is a shortage of computational approach that can deliver single base pair resolution prediction of TFBS.

Results: In this paper, we propose DeepSNR, a Deep Learning algorithm for predicting TF binding location at Single Nucleotide Resolution de novo from DNA sequence. DeepSNR adopts a novel deconvolutional network (deconvNet) model and is inspired by the similarity to image segmentation by deconvNet. The proposed deconvNet architecture is constructed on top of 'DeepBind' and we trained the entire model using TF-specific data from ChIP-exonuclease (ChIP-exo) experiments. DeepSNR has been shown to outperform motif search-based methods for several evaluation metrics. We have also demonstrated the usefulness of DeepSNR in the regulatory analysis of TFBS as well as in improving the TFBS prediction specificity using ChIP-seq data.

Availability and implementation: DeepSNR is available open source in the GitHub repository (https://github.com/sirajulsalekin/DeepSNR).

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Base Pairing
  • Binding Sites
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Protein Binding
  • Sequence Analysis, DNA / methods*
  • Software
  • Transcription Factors / chemistry
  • Transcription Factors / metabolism*

Substances

  • Transcription Factors