Open-source benchmarking of IBD segment detection methods for biobank-scale cohorts

Gigascience. 2022 Dec 6:11:giac111. doi: 10.1093/gigascience/giac111.

Abstract

In the recent biobank era of genetics, the problem of identical-by-descent (IBD) segment detection received renewed interest, as IBD segments in large cohorts offer unprecedented opportunities in the study of population and genealogical history, as well as genetic association of long haplotypes. While a new generation of efficient methods for IBD segment detection becomes available, direct comparison of these methods is difficult: existing benchmarks were often evaluated in different datasets, with some not openly accessible; methods benchmarked were run under suboptimal parameters; and benchmark performance metrics were not defined consistently. Here, we developed a comprehensive and completely open-source evaluation of the power, accuracy, and resource consumption of these IBD segment detection methods using realistic population genetic simulations with various settings. Our results pave the road for fair evaluation of IBD segment detection methods and provide an practical guide for users.

Keywords: IBD segment detection tools; benchmarking; biobank-scale data; identical-by-descent.

MeSH terms

  • Biological Specimen Banks*
  • Humans