Impacts of population structure and analytical models in genome-wide association studies of complex traits in forest trees: a case study in Eucalyptus globulus

PLoS One. 2013 Nov 25;8(11):e81267. doi: 10.1371/journal.pone.0081267. eCollection 2013.

Abstract

The promise of association genetics to identify genes or genomic regions controlling complex traits has generated a flurry of interest. Such phenotype-genotype associations could be useful to accelerate tree breeding cycles, increase precision and selection intensity for late expressing, low heritability traits. However, the prospects of association genetics in highly heterozygous undomesticated forest trees can be severely impacted by the presence of cryptic population and pedigree structure. To investigate how to better account for this, we compared the GLM and five combinations of the Unified Mixed Model ( UMM ) on data of a low-density genome-wide association study for growth and wood property traits carried out in a Eucalyptus globulus population (n = 303) with 7,680 Diversity Array Technology (DArT) markers. Model comparisons were based on the degree of deviation from the uniform distribution and estimates of the mean square differences between the observed and expected p-values of all significant marker-trait associations detected. Our analysis revealed the presence of population and family structure. There was not a single best model for all traits. Striking differences in detection power and accuracy were observed among the different models especially when population structure was not accounted for. The UMM method was the best and produced superior results when compared to GLM for all traits. Following stringent correction for false discoveries, 18 marker-trait associations were detected, 16 for tree diameter growth and two for lignin monomer composition (S:G ratio), a key wood property trait. The two DArT markers associated with S:G ratio on chromosome 10, physically map within 1 Mbp of the ferulate 5-hydroxylase (F5H) gene, providing a putative independent validation of this marker-trait association. This study details the merit of collectively integrate population structure and relatedness in association analyses in undomesticated, highly heterozygous forest trees, and provides additional insights into the nature of complex quantitative traits in Eucalyptus.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Eucalyptus / genetics*
  • Genes, Plant
  • Genetic Markers
  • Genome-Wide Association Study*
  • Models, Theoretical*
  • Principal Component Analysis
  • Quantitative Trait Loci

Substances

  • Genetic Markers

Grants and funding

This research was supported by grant from BiotecSur UE 127118 project to SNMP and DG, Agencia Nacional de Promoción Científica y Tecnológica (PICT 00118 and PICT 00321) of Argentina to SNMP and EPC, and CNPq CBAB (Brazil-Argentina Biotechnology Center) grant 577047/2008-6 to DG that supported the DArT marker array development. PICT 00321 is below the Programa de Modernización Tecnológica III, Contrato de Préstamo BID 1728/OC-AR. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.