Practical FDR-based sample size calculations in microarray experiments

Jianhua Hu; Fei Zou; Fred A Wright

doi:10.1093/bioinformatics/bti519

Practical FDR-based sample size calculations in microarray experiments

Bioinformatics. 2005 Aug 1;21(15):3264-72. doi: 10.1093/bioinformatics/bti519. Epub 2005 Jun 2.

Authors

Jianhua Hu¹, Fei Zou, Fred A Wright

Affiliation

¹ Department of Biostatistics and Applied Mathematics, University of Texas M.D. Anderson Cancer Center, TX 77030-4009, USA. [email protected]

PMID: 15932903
DOI: 10.1093/bioinformatics/bti519

Abstract

Motivation: Owing to the experimental cost and difficulty in obtaining biological materials, it is essential to consider appropriate sample sizes in microarray studies. With the growing use of the False Discovery Rate (FDR) in microarray analysis, an FDR-based sample size calculation is essential.

Method: We describe an approach to explicitly connect the sample size to the FDR and the number of differentially expressed genes to be detected. The method fits parametric models for degree of differential expression using the Expectation-Maximization algorithm.

Results: The applicability of the method is illustrated with simulations and studies of a lung microarray dataset. We propose to use a small training set or published data from relevant biological settings to calculate the sample size of an experiment.

Availability: Code to implement the method in the statistical package R is available from the authors.

Publication types

Evaluation Study
Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Algorithms*
Data Interpretation, Statistical*
False Positive Reactions
Gene Expression Profiling / methods*
Models, Genetic*
Models, Statistical
Oligonucleotide Array Sequence Analysis / methods*
Sample Size
Software*

Grants and funding

3 P30 HD003110/HD/NICHD NIH HHS/United States