Kernel machine SNP-set analysis for censored survival outcomes in genome-wide association studies

Genet Epidemiol. 2011 Nov;35(7):620-31. doi: 10.1002/gepi.20610. Epub 2011 Aug 4.

Abstract

In this article, we develop a powerful test for identifying single nucleotide polymorphism (SNP)-sets that are predictive of survival with data from genome-wide association studies. We first group typed SNPs into SNP-sets based on genomic features and then apply a score test to assess the overall effect of each SNP-set on the survival outcome through a kernel machine Cox regression framework. This approach uses genetic information from all SNPs in the SNP-set simultaneously and accounts for linkage disequilibrium (LD), leading to a powerful test with reduced degrees of freedom when the typed SNPs are in LD with each other. This type of test also has the advantage of capturing the potentially nonlinear effects of the SNPs, SNP-SNP interactions (epistasis), and the joint effects of multiple causal variants. By simulating SNP data based on the LD structure of real genes from the HapMap project, we demonstrate that our proposed test is more powerful than the standard single SNP minimum P-value-based test for association studies with censored survival outcomes. We illustrate the proposed test with a real data application.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Epistasis, Genetic
  • Genome-Wide Association Study / statistics & numerical data*
  • HapMap Project
  • Humans
  • Linkage Disequilibrium
  • Lung Neoplasms / genetics*
  • Lung Neoplasms / mortality*
  • Models, Genetic*
  • Polymorphism, Single Nucleotide*