Estimation of significance thresholds for genomewide association scans

Frank Dudbridge; Arief Gusnanto

doi:10.1002/gepi.20297

Estimation of significance thresholds for genomewide association scans

Genet Epidemiol. 2008 Apr;32(3):227-34. doi: 10.1002/gepi.20297.

Authors

Frank Dudbridge¹, Arief Gusnanto

Affiliation

¹ MRC Biostatistics Unit, Institute for Public Health, Cambridge, United Kingdom. [email protected]

Abstract

The question of what significance threshold is appropriate for genomewide association studies is somewhat unresolved. Previous theoretical suggestions have yet to be validated in practice, whereas permutation testing does not resolve a discrepancy between the genomewide multiplicity of the experiment and the subset of markers actually tested. We used genotypes from the Wellcome Trust Case-Control Consortium to estimate a genomewide significance threshold for the UK Caucasian population. We subsampled the genotypes at increasing densities, using permutation to estimate the nominal P-value for 5% family-wise error. By extrapolating to infinite density, we estimated the genomewide significance threshold to be about 7.2 x 10(-8). To reduce the computation time, we considered Patterson's eigenvalue estimator of the effective number of tests, but found it to be an order of magnitude too low for multiplicity correction. However, by fitting a Beta distribution to the minimum P-value from permutation replicates, we showed that the effective number is a useful heuristic and suggest that its estimation in this context is an open problem. We conclude that permutation is still needed to obtain genomewide significance thresholds, but with subsampling, extrapolation and estimation of an effective number of tests, the threshold can be standardized for all studies of the same population.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Case-Control Studies
Genetic Markers
Genetic Predisposition to Disease / genetics*
Genome, Human / genetics*
Genotype
Humans
United Kingdom
White People / genetics

Substances

Genetic Markers

Abstract

Publication types

MeSH terms

Substances

Grants and funding