The promise of automated machine learning for the genetic analysis of complex traits

Hum Genet. 2022 Sep;141(9):1529-1544. doi: 10.1007/s00439-021-02393-x. Epub 2021 Oct 28.

Abstract

The genetic analysis of complex traits has been dominated by parametric statistical methods due to their theoretical properties, ease of use, computational efficiency, and intuitive interpretation. However, there are likely to be patterns arising from complex genetic architectures which are more easily detected and modeled using machine learning methods. Unfortunately, selecting the right machine learning algorithm and tuning its hyperparameters can be daunting for experts and non-experts alike. The goal of automated machine learning (AutoML) is to let a computer algorithm identify the right algorithms and hyperparameters thus taking the guesswork out of the optimization process. We review the promises and challenges of AutoML for the genetic analysis of complex traits and give an overview of several approaches and some example applications to omics data. It is our hope that this review will motivate studies to develop and evaluate novel AutoML methods and software in the genetics and genomics space. The promise of AutoML is to enable anyone, regardless of training or expertise, to apply machine learning as part of their genetic analysis strategy.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Genomics / methods
  • Humans
  • Machine Learning*
  • Multifactorial Inheritance*
  • Software