Determination of the differentially expressed genes in microarray experiments using local FDR

BMC Bioinformatics. 2004 Sep 6:5:125. doi: 10.1186/1471-2105-5-125.

Abstract

Background: Thousands of genes in a genomewide data set are tested against some null hypothesis, for detecting differentially expressed genes in microarray experiments. The expected proportion of false positive genes in a set of genes, called the False Discovery Rate (FDR), has been proposed to measure the statistical significance of this set. Various procedures exist for controlling the FDR. However the threshold (generally 5%) is arbitrary and a specific measure associated with each gene would be worthwhile.

Results: Using process intensity estimation methods, we define and give estimates of the local FDR, which may be considered as the probability for a gene to be a false positive. After a global assessment rule controlling the false positive error, the local FDR is a valuable guideline for deciding wether a gene is differentially expressed. The interest of the method is illustrated on three well known data sets. A R routine for computing local FDR estimates from p-values is available at http://www.inapg.fr/ens_rech/mathinfo/recherche/mathematique/outil.html.

Conclusions: The local FDR associated with each gene measures the probability that it is a false positive. It gives the opportunity to compute the FDR of any given group of clones (of the same gene) or genes pertaining to the same regulation network or the same chromosomic region.

MeSH terms

  • Acute Disease
  • Animals
  • Apolipoprotein A-I / genetics
  • Breast Neoplasms / genetics
  • Cholesterol, HDL / blood
  • Cholesterol, HDL / genetics
  • Data Interpretation, Statistical
  • Gene Expression Profiling / statistics & numerical data*
  • Gene Expression Regulation / genetics*
  • Gene Expression Regulation, Neoplastic / genetics*
  • Humans
  • Leukemia, Myeloid / genetics
  • Mice
  • Mice, Knockout
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data*
  • Precursor Cell Lymphoblastic Leukemia-Lymphoma / genetics
  • Research Design / statistics & numerical data

Substances

  • Apolipoprotein A-I
  • Cholesterol, HDL