Bioinformatics approaches to predict target genes from transcription factor binding data

Methods. 2017 Dec 1:131:111-119. doi: 10.1016/j.ymeth.2017.09.001. Epub 2017 Sep 7.

Abstract

Transcription factors regulate gene expression and play an essential role in development by maintaining proliferative states, driving cellular differentiation and determining cell fate. Transcription factors are capable of regulating multiple genes over potentially long distances making target gene identification challenging. Currently available experimental approaches to detect distal interactions have multiple weaknesses that have motivated the development of computational approaches. Although an improvement over experimental approaches, existing computational approaches are still limited in their application, with different weaknesses depending on the approach. Here, we review computational approaches with a focus on data dependency, cell type specificity and usability. With the aim of identifying transcription factor target genes, we apply available approaches to typical transcription factor experimental datasets. We show that approaches are not always capable of annotating all transcription factor binding sites; binding sites should be treated disparately; and a combination of approaches can increase the biological relevance of the set of genes identified as targets.

Keywords: Bioinformatics; Cancer; Enhancer; Promoter; Regulatory interactions; Transcription factor.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Binding Sites / genetics
  • Chromatin Immunoprecipitation / methods
  • Computational Biology / methods*
  • Datasets as Topic
  • Gene Expression Regulation / genetics*
  • Humans
  • Promoter Regions, Genetic / genetics*
  • Protein Binding / genetics
  • Sequence Analysis, DNA / methods
  • Software
  • Transcription Factors / genetics*
  • Transcription, Genetic / genetics*

Substances

  • Transcription Factors