Resampling to Address the Winner's Curse in Genetic Association Analysis of Time to Event

Genet Epidemiol. 2015 Nov;39(7):518-28. doi: 10.1002/gepi.21920. Epub 2015 Sep 28.

Abstract

The "winner's curse" is a subtle and difficult problem in interpretation of genetic association, in which association estimates from large-scale gene detection studies are larger in magnitude than those from subsequent replication studies. This is practically important because use of a biased estimate from the original study will yield an underestimate of sample size requirements for replication, leaving the investigators with an underpowered study. Motivated by investigation of the genetics of type 1 diabetes complications in a longitudinal cohort of participants in the Diabetes Control and Complications Trial/Epidemiology of Diabetes Interventions and Complications (DCCT/EDIC) Genetics Study, we apply a bootstrap resampling method in analysis of time to nephropathy under a Cox proportional hazards model, examining 1,213 single-nucleotide polymorphisms (SNPs) in 201 candidate genes custom genotyped in 1,361 white probands. Among 15 top-ranked SNPs, bias reduction in log hazard ratio estimates ranges from 43.1% to 80.5%. In simulation studies based on the observed DCCT/EDIC genotype data, genome-wide bootstrap estimates for false-positive SNPs and for true-positive SNPs with low-to-moderate power are closer to the true values than uncorrected naïve estimates, but tend to overcorrect SNPs with high power. This bias-reduction technique is generally applicable for complex trait studies including quantitative, binary, and time-to-event traits.

Keywords: DCCT/EDIC Genetics Study; bootstrap; cohort studies; genotype; phenotype; selection bias; survival analysis.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bias
  • Diabetes Mellitus, Type 1 / complications
  • Diabetes Mellitus, Type 1 / epidemiology
  • Diabetes Mellitus, Type 1 / genetics
  • Diabetes Mellitus, Type 1 / therapy
  • False Positive Reactions
  • Female
  • Genome-Wide Association Study / methods*
  • Genotype
  • Humans
  • Kidney Diseases / complications
  • Kidney Diseases / genetics
  • Kidney Diseases / pathology
  • Male
  • Models, Genetic
  • Phenotype
  • Polymorphism, Single Nucleotide / genetics
  • Proportional Hazards Models
  • Risk
  • Sample Size
  • Time Factors