Statistical methods for detecting genetic interactions: a head and neck squamous-cell cancer study

J Toxicol Environ Health A. 2008;71(11-12):803-15. doi: 10.1080/15287390801985745.

Abstract

Tobacco smoke and occupational exposures to chemicals such as polycyclic aromatic hydrocarbons (PAHs) are, aside from alcohol, the major risk factors for development of head and neck squamous-cell cancer (HNSCC). In this study, new statistical methods were applied. We employ new statistical methods to detect genetic interactions perhaps of higher order, that might play a role in developing HNSCC. The underlying study comprises 312 HNSCC cases and 300 controls. Single-nucleotide polymorphisms (SNPs) of PAH metabolizing and repair enzymes, somatic p53 mutations, and tobacco smoke were examined. Key statistical tools for our analysis are methods of unsupervised and supervised learning. In unsupervised learning, one performs cluster analyses based on well-known and new distance measures to find differences in the SNP patterns of cases and controls, and to understand the role of p53. Our main goal in supervised learning was to identify SNPs and SNP interactions that are likely to alter the susceptibility to HNSCC. Logic regression, a classification method well suited for SNPs, was employed as well as a Bayesian generalization that allows for incorporating additional expert knowledge. These methods detected several important interactions, such as an association between CYP1B1, tobacco smokes and p53 mutations and some interactions between CYP1B1 and glutathione S-transferases in smokers, which included a three-way interaction between CYP1B1, CYP2E1-70G>T, and GSTP1 (exon 5).

MeSH terms

  • Aryl Hydrocarbon Hydroxylases / genetics
  • Bayes Theorem
  • Carcinoma, Squamous Cell / blood
  • Carcinoma, Squamous Cell / genetics*
  • Case-Control Studies
  • Cluster Analysis
  • Cytochrome P-450 CYP1B1
  • DNA, Neoplasm / blood
  • Data Interpretation, Statistical
  • Female
  • Genes, p53 / genetics
  • Genetic Predisposition to Disease*
  • Glutathione Transferase / genetics
  • Head and Neck Neoplasms / blood
  • Head and Neck Neoplasms / genetics*
  • Humans
  • Logistic Models
  • Male
  • Middle Aged
  • Models, Statistical*
  • Polymorphism, Single Nucleotide
  • Smoking
  • Tobacco Use Disorder / genetics

Substances

  • DNA, Neoplasm
  • Aryl Hydrocarbon Hydroxylases
  • CYP1B1 protein, human
  • Cytochrome P-450 CYP1B1
  • Glutathione Transferase