A genome-wide linkage and association analysis of imputed insertions and deletions with cardiometabolic phenotypes in Mexican Americans: The Insulin Resistance Atherosclerosis Family Study

Genet Epidemiol. 2017 May;41(4):353-362. doi: 10.1002/gepi.22042. Epub 2017 Apr 5.

Abstract

Insertions and deletions (INDELs) represent a significant fraction of interindividual variation in the human genome yet their contribution to phenotypes is poorly understood. To confirm the quality of imputed INDELs and investigate their roles in mediating cardiometabolic phenotypes, genome-wide association and linkage analyses were performed for 15 phenotypes with 1,273,952 imputed INDELs in 1,024 Mexican-origin Americans. Imputation quality was validated using whole exome sequencing with an average kappa of 0.93 in common INDELs (minor allele frequencies [MAFs] ≥ 5%). Association analysis revealed one genome-wide significant association signal for the cholesterylester transfer protein gene (CETP) with high-density lipoprotein levels (rs36229491, P = 3.06 × 10-12 ); linkage analysis identified two peaks with logarithm of the odds (LOD) > 5 (rs60560566, LOD = 5.36 with insulin sensitivity (SI ) and rs5825825, LOD = 5.11 with adiponectin levels). Suggestive overlapping signals between linkage and association were observed: rs59849892 in the WSC domain containing 2 gene (WSCD2) was associated and nominally linked with SI (P = 1.17 × 10-7 , LOD = 1.99). This gene has been implicated in glucose metabolism in human islet cell expression studies. In addition, rs201606363 was linked and nominally associated with low-density lipoprotein (P = 4.73 × 10-4 , LOD = 3.67), apolipoprotein B (P = 1.39 × 10-3 , LOD = 4.64), and total cholesterol (P = 1.35 × 10-2 , LOD = 3.80) levels. rs201606363 is an intronic variant of the UBE2F-SCLY (where UBE2F is ubiquitin-conjugating enzyme E2F and SCLY is selenocysteine lyase) fusion gene that may regulate cholesterol through selenium metabolism. In conclusion, these results confirm the feasibility of imputing INDELs from array-based single nucleotide polymorphism (SNP) genotypes. Analysis of these variants using association and linkage replicated previously identified SNP signals and identified multiple novel INDEL signals. These results support the inclusion of INDELs into genetic studies to more fully interrogate the spectrum of genetic variation.

Keywords: cardiometabolic disease; genome-wide association analysis; imputation; insertion/deletions; linkage analysis.

MeSH terms

  • Adult
  • Atherosclerosis / genetics*
  • Demography
  • Family
  • Female
  • Gene Frequency / genetics
  • Genetic Linkage*
  • Genome, Human
  • Genome-Wide Association Study*
  • Humans
  • INDEL Mutation / genetics*
  • Insulin Resistance / genetics*
  • Male
  • Mexican Americans / genetics*
  • Phenotype
  • Polymorphism, Single Nucleotide / genetics