Quality control, imputation and analysis of genome-wide genotyping data from the Illumina HumanCoreExome microarray

Brief Funct Genomics. 2016 Jul;15(4):298-304. doi: 10.1093/bfgp/elv037. Epub 2015 Oct 5.

Abstract

The decreasing cost of performing genome-wide association studies has made genomics widely accessible. However, there is a paucity of guidance for best practice in conducting such analyses. For the results of a study to be valid and replicable, multiple biases must be addressed in the course of data preparation and analysis. In addition, standardizing methods across small, independent studies would increase comparability and the potential for effective meta-analysis. This article provides a discussion of important aspects of quality control, imputation and analysis of genome-wide data from a low-coverage microarray, as well as a straight-forward guide to performing a genome-wide association study. A detailed protocol is provided online, with example scripts available at https://github.com/JoniColeman/gwas_scripts.

Keywords: GWAS; analysis; imputation; low-coverage microarray; methods.

MeSH terms

  • Algorithms
  • Cognition Disorders / genetics*
  • Cognition Disorders / therapy
  • Cognitive Behavioral Therapy*
  • Computational Biology / methods*
  • Exome*
  • Genome, Human*
  • Genome-Wide Association Study / methods*
  • Genotype
  • Humans
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Quality Control*
  • Software