Coexpression analysis of human genes across many microarray data sets

Genome Res. 2004 Jun;14(6):1085-94. doi: 10.1101/gr.1910904.

Abstract

We present a large-scale analysis of mRNA coexpression based on 60 large human data sets containing a total of 3924 microarrays. We sought pairs of genes that were reliably coexpressed (based on the correlation of their expression profiles) in multiple data sets, establishing a high-confidence network of 8805 genes connected by 220,649 "coexpression links" that are observed in at least three data sets. Confirmed positive correlations between genes were much more common than confirmed negative correlations. We show that confirmation of coexpression in multiple data sets is correlated with functional relatedness, and show how cluster analysis of the network can reveal functionally coherent groups of genes. Our findings demonstrate how the large body of accumulated microarray data can be exploited to increase the reliability of inferences about gene function.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cluster Analysis
  • Gene Expression Profiling / statistics & numerical data*
  • Gene Expression Regulation / genetics*
  • Genes / genetics*
  • Genetic Linkage / genetics
  • Humans
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data*