GEO2Enrichr: browser extension and server app to extract gene sets from GEO and analyze them for biological functions

Bioinformatics. 2015 Sep 15;31(18):3060-2. doi: 10.1093/bioinformatics/btv297. Epub 2015 May 13.

Abstract

Motivation: Identification of differentially expressed genes is an important step in extracting knowledge from gene expression profiling studies. The raw expression data from microarray and other high-throughput technologies is deposited into the Gene Expression Omnibus (GEO) and served as Simple Omnibus Format in Text (SOFT) files. However, to extract and analyze differentially expressed genes from GEO requires significant computational skills.

Results: Here we introduce GEO2Enrichr, a browser extension for extracting differentially expressed gene sets from GEO and analyzing those sets with Enrichr, an independent gene set enrichment analysis tool containing over 70 000 annotated gene sets organized into 75 gene-set libraries. GEO2Enrichr adds JavaScript code to GEO web-pages; this code scrapes user selected accession numbers and metadata, and then, with one click, users can submit this information to a web-server application that downloads the SOFT files, parses, cleans and normalizes the data, identifies the differentially expressed genes, and then pipes the resulting gene lists to Enrichr for downstream functional analysis. GEO2Enrichr opens a new avenue for adding functionality to major bioinformatics resources such GEO by integrating tools and resources without the need for a plug-in architecture. Importantly, GEO2Enrichr helps researchers to quickly explore hypotheses with little technical overhead, lowering the barrier of entry for biologists by automating data processing steps needed for knowledge extraction from the major repository GEO.

Availability and implementation: GEO2Enrichr is an open source tool, freely available for installation as browser extensions at the Chrome Web Store and FireFox Add-ons. Documentation and a browser independent web application can be found at http://amp.pharm.mssm.edu/g2e/.

Contact: [email protected].

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • 3T3 Cells
  • Animals
  • Computational Biology / methods*
  • Databases, Genetic*
  • Electronic Data Processing
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation
  • Gene Library
  • Internet
  • Mice
  • Microarray Analysis / methods*
  • TRPV Cation Channels / physiology*
  • User-Computer Interface

Substances

  • TRPV Cation Channels
  • Trpv4 protein, mouse