[Gene-based principal component logistic regression model and its application on genome-wide association study]

Zhonghua Liu Xing Bing Xue Za Zhi. 2012 Jun;33(6):622-5.
[Article in Chinese]

Abstract

To explore the gene-based principal component logistic regression model and its application in genome-wide association study. Using the simulated genome-wide single nucleotide polymorphism (SNPs) genotypes data, we proposed a practical statistical analysis strategy-'the principal component logistic regression model', based on the gene levels to assess the association between genetic variations and complex diseases. The simulation results showed that the P value of genes in related diseases was the smallest among the results from all the genes. The results of simulation indicated that not only it could reduce the degree of freedom through hypothesis testing but could also better understand the correlations between SNPs. The gene-based principal component logistic regression model seemed to have certain statistical power for testing the association between genetic genes and diseases in the genome-wide association studies.

Publication types

  • English Abstract
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computer Simulation
  • Genome-Wide Association Study*
  • Humans
  • Logistic Models*
  • Polymorphism, Single Nucleotide
  • Principal Component Analysis