Statistical methods for expression quantitative trait loci (eQTL) mapping

Biometrics. 2006 Mar;62(1):19-27. doi: 10.1111/j.1541-0420.2005.00437.x.

Abstract

Traditional genetic mapping has largely focused on the identification of loci affecting one, or at most a few, complex traits. Microarrays allow for measurement of thousands of gene expression abundances, themselves complex traits, and a number of recent investigations have considered these measurements as phenotypes in mapping studies. Combining traditional quantitative trait loci (QTL) mapping methods with microarray data is a powerful approach with demonstrated utility in a number of recent biological investigations. These expression quantitative trait loci (eQTL) studies are similar to traditional QTL studies, as a main goal is to identify the genomic locations to which the expression traits are linked. However, eQTL studies probe thousands of expression transcripts; and as a result, standard multi-trait QTL mapping methods, designed to handle at most tens of traits, do not directly apply. One possible approach is to use single-trait QTL mapping methods to analyze each transcript separately. This leads to an increased number of false discoveries, as corrections for multiple tests across transcripts are not made. Similarly, the repeated application, at each marker, of methods for identifying differentially expressed transcripts suffers from multiple tests across markers. Here, we demonstrate the deficiencies of these approaches and propose a mixture over markers (MOM) model that shares information across both markers and transcripts. The utility of all methods is evaluated using simulated data as well as data from an F(2) mouse cross in a study of diabetes. Results from simulation studies indicate that the MOM model is best at controlling false discoveries, without sacrificing power. The MOM model is also the only one capable of finding two genome regions previously shown to be involved in diabetes.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Chromosome Mapping / statistics & numerical data
  • Diabetes Mellitus / genetics
  • False Positive Reactions
  • Genetic Markers
  • Mice
  • Mice, Inbred C57BL
  • Mice, Mutant Strains
  • Models, Statistical*
  • Oligonucleotide Array Sequence Analysis
  • Quantitative Trait Loci*
  • RNA, Messenger / analysis

Substances

  • Genetic Markers
  • RNA, Messenger