T-Gene: improved target gene prediction

Bioinformatics. 2020 Jun 1;36(12):3902-3904. doi: 10.1093/bioinformatics/btaa227.

Abstract

Motivation: Identifying the genes regulated by a given transcription factor (TF) (its 'target genes') is a key step in developing a comprehensive understanding of gene regulation. Previously, we developed a method (CisMapper) for predicting the target genes of a TF based solely on the correlation between a histone modification at the TF's binding site and the expression of the gene across a set of tissues or cell lines. That approach is limited to organisms for which extensive histone and expression data are available, and does not explicitly incorporate the genomic distance between the TF and the gene.

Results: We present the T-Gene algorithm, which overcomes these limitations. It can be used to predict which genes are most likely to be regulated by a TF, and which of the TF's binding sites are most likely involved in regulating particular genes. T-Gene calculates a novel score that combines distance and histone/expression correlation, and we show that this score accurately predicts when a regulatory element bound by a TF is in contact with a gene's promoter, achieving median precision above 60%. T-Gene is easy to use via its web server or as a command-line tool, and can also make accurate predictions (median precision above 40%) based on distance alone when extensive histone/expression data is not available for the organism. T-Gene provides an estimate of the statistical significance of each of its predictions.

Availability and implementation: The T-Gene web server, source code, histone/expression data and genome annotation files are provided at http://meme-suite.org.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Binding Sites
  • Chromatin Immunoprecipitation
  • Gene Expression Regulation
  • Software*
  • Transcription Factors / genetics
  • Transcription Factors / metabolism

Substances

  • Transcription Factors