Prior biological knowledge-based approaches for the analysis of genome-wide expression profiles using gene sets and pathways

Stat Methods Med Res. 2009 Dec;18(6):577-93. doi: 10.1177/0962280209351925.

Abstract

An increasing challenge in analysis of microarray data is how to interpret and gain biological insight of profiles of thousands of genes. This article provides a review of statistical methods for analysis of microarray data by incorporating prior biological knowledge using gene sets and biological pathways, which consist of groups of biologically similar genes. We first discuss issues of individual gene analysis. We compare several methods for analysis of gene sets including over-representation anlaysis, gene set enrichment analysis, principal component analysis, global test and kernel machine. We discuss the assumptions of these methods and their pros and cons. We illustrate these methods by application to a type II diabetes data set.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Computational Biology / methods*
  • Diabetes Mellitus, Type 2 / genetics
  • Gene Expression Profiling / methods
  • Gene Expression Profiling / statistics & numerical data*
  • Genome*
  • Humans
  • Knowledge Bases*
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis / methods