Expresso: A database and web server for exploring the interaction of transcription factors and their target genes in Arabidopsis thaliana using ChIP-Seq peak data

Delasa Aghamirzaie; Karthik Raja Velmurugan; Shuchi Wu; Doaa Altarawy; Lenwood S Heath; Ruth Grene

doi:10.12688/f1000research.10041.1

Expresso: A database and web server for exploring the interaction of transcription factors and their target genes in Arabidopsis thaliana using ChIP-Seq peak data

F1000Res. 2017 Mar 28:6:372. doi: 10.12688/f1000research.10041.1. eCollection 2017.

Authors

Delasa Aghamirzaie¹, Karthik Raja Velmurugan^{1

2}, Shuchi Wu³, Doaa Altarawy⁴, Lenwood S Heath⁴, Ruth Grene⁵

Affiliations

¹ Genetics, Bioinformatics, and Computational Biology (GBCB), Virginia Tech, Blacksburg, VA, 24061, USA.
² Center for Bioinformatics and Genetics and the Primary Care Research Network, Edward Via College of Osteopathic Medicine, Blacksburg, VA, 24060, USA.
³ Department of Horticulture, Virginia Tech, Blacksburg, VA, 24061, USA.
⁴ Department of Computer Science, Virginia Tech, Blacksburg, VA, 24061, USA.
⁵ Department of Plant Pathology, Physiology, and Weed Science, Virginia Tech, Blacksburg, VA, 24061, USA.

Abstract

Motivation: The increasing availability of chromatin immunoprecipitation sequencing (ChIP-Seq) data enables us to learn more about the action of transcription factors in the regulation of gene expression. Even though in vivo transcriptional regulation often involves the concerted action of more than one transcription factor, the format of each individual ChIP-Seq dataset usually represents the action of a single transcription factor. Therefore, a relational database in which available ChIP-Seq datasets are curated is essential. Results: We present Expresso (database and webserver) as a tool for the collection and integration of available Arabidopsis ChIP-Seq peak data, which in turn can be linked to a user's gene expression data. Known target genes of transcription factors were identified by motif analysis of publicly available GEO ChIP-Seq data sets. Expresso currently provides three services: 1) Identification of target genes of a given transcription factor; 2) Identification of transcription factors that regulate a gene of interest; 3) Computation of correlation between the gene expression of transcription factors and their target genes. Availability: Expresso is freely available at http://bioinformatics.cs.vt.edu/expresso/.

Keywords: ChIP-Seq; gene regulation; transcription factor; transcriptional regulation.

Grants and funding

This project was supported by National Science Foundation [NSF-MCB-1052145 and NSF-ABI-1062472].