Non-parametric genetic prediction of complex traits with latent Dirichlet process regression models

Nat Commun. 2017 Sep 6;8(1):456. doi: 10.1038/s41467-017-00470-2.

Abstract

Using genotype data to perform accurate genetic prediction of complex traits can facilitate genomic selection in animal and plant breeding programs, and can aid in the development of personalized medicine in humans. Because most complex traits have a polygenic architecture, accurate genetic prediction often requires modeling all genetic variants together via polygenic methods. Here, we develop such a polygenic method, which we refer to as the latent Dirichlet process regression model. Dirichlet process regression is non-parametric in nature, relies on the Dirichlet process to flexibly and adaptively model the effect size distribution, and thus enjoys robust prediction performance across a broad spectrum of genetic architectures. We compare Dirichlet process regression with several commonly used prediction methods with simulations. We further apply Dirichlet process regression to predict gene expressions, to conduct PrediXcan based gene set test, to perform genomic selection of four traits in two species, and to predict eight complex traits in a human cohort.Genetic prediction of complex traits with polygenic architecture has wide application from animal breeding to disease prevention. Here, Zeng and Zhou develop a non-parametric genetic prediction method based on latent Dirichlet Process regression models.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Animals
  • Bayes Theorem
  • Cattle
  • Computational Biology / methods*
  • Computer Simulation*
  • Genetic Variation
  • Genomics
  • Genotype
  • Humans
  • Models, Genetic*
  • Multifactorial Inheritance
  • Phenotype
  • Regression Analysis
  • Selection, Genetic
  • Software
  • Species Specificity
  • Zea mays / genetics