Unified Sequence-Based Association Tests Allowing for Multiple Functional Annotations and Meta-analysis of Noncoding Variation in Metabochip Data

Am J Hum Genet. 2017 Sep 7;101(3):340-352. doi: 10.1016/j.ajhg.2017.07.011. Epub 2017 Aug 24.

Abstract

Substantial progress has been made in the functional annotation of genetic variation in the human genome. Integrative analysis that incorporates such functional annotations into sequencing studies can aid the discovery of disease-associated genetic variants, especially those with unknown function and located outside protein-coding regions. Direct incorporation of one functional annotation as weight in existing dispersion and burden tests can suffer substantial loss of power when the functional annotation is not predictive of the risk status of a variant. Here, we have developed unified tests that can utilize multiple functional annotations simultaneously for integrative association analysis with efficient computational techniques. We show that the proposed tests significantly improve power when variant risk status can be predicted by functional annotations. Importantly, when functional annotations are not predictive of risk status, the proposed tests incur only minimal loss of power in relation to existing dispersion and burden tests, and under certain circumstances they can even have improved power by learning a weight that better approximates the underlying disease model in a data-adaptive manner. The tests can be constructed with summary statistics of existing dispersion and burden tests for sequencing data, therefore allowing meta-analysis of multiple studies without sharing individual-level data. We applied the proposed tests to a meta-analysis of noncoding rare variants in Metabochip data on 12,281 individuals from eight studies for lipid traits. By incorporating the Eigen functional score, we detected significant associations between noncoding rare variants in SLC22A3 and low-density lipoprotein and total cholesterol, associations that are missed by standard dispersion and burden tests.

Keywords: Metabochip data; functional annotations; integrative methods; meta-anlysis; sequence-based association tests.

MeSH terms

  • Cardiovascular Diseases / genetics
  • Cardiovascular Diseases / metabolism
  • Cardiovascular Diseases / pathology
  • Computer Simulation
  • Genetic Predisposition to Disease
  • Genome, Human*
  • Genome-Wide Association Study / methods*
  • Humans
  • Lipids / analysis
  • Metabolomics*
  • Molecular Sequence Annotation / methods*
  • Organic Cation Transport Proteins / genetics
  • Phenotype
  • Polymorphism, Single Nucleotide*

Substances

  • Lipids
  • Organic Cation Transport Proteins
  • solute carrier family 22 (organic cation transporter), member 3