Clustering analysis of SAGE data using a Poisson approach

Genome Biol. 2004;5(7):R51. doi: 10.1186/gb-2004-5-7-r51. Epub 2004 Jun 29.

Abstract

Serial analysis of gene expression (SAGE) data have been poorly exploited by clustering analysis owing to the lack of appropriate statistical methods that consider their specific properties. We modeled SAGE data by Poisson statistics and developed two Poisson-based distances. Their application to simulated and experimental mouse retina data show that the Poisson-based distances are more appropriate and reliable for analyzing SAGE data compared to other commonly used distances or similarity measures such as Pearson correlation or Euclidean distance.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.
  • Validation Study

MeSH terms

  • Animals
  • Animals, Newborn / genetics
  • Chromosome Mapping / statistics & numerical data
  • Cluster Analysis
  • Computer Simulation / statistics & numerical data
  • Gene Expression Profiling / statistics & numerical data*
  • Gene Library
  • Mice
  • Models, Statistical
  • Poisson Distribution
  • Retina / chemistry
  • Retina / embryology
  • Retina / metabolism
  • Sensitivity and Specificity