Inferring Epistasis from Genetic Time-series Data

Mol Biol Evol. 2022 Oct 7;39(10):msac199. doi: 10.1093/molbev/msac199.

Abstract

Epistasis refers to fitness or functional effects of mutations that depend on the sequence background in which these mutations arise. Epistasis is prevalent in nature, including populations of viruses, bacteria, and cancers, and can contribute to the evolution of drug resistance and immune escape. However, it is difficult to directly estimate epistatic effects from sampled observations of a population. At present, there are very few methods that can disentangle the effects of selection (including epistasis), mutation, recombination, genetic drift, and genetic linkage in evolving populations. Here we develop a method to infer epistasis, along with the fitness effects of individual mutations, from observed evolutionary histories. Simulations show that we can accurately infer pairwise epistatic interactions provided that there is sufficient genetic diversity in the data. Our method also allows us to identify which fitness parameters can be reliably inferred from a particular data set and which ones are unidentifiable. Our approach therefore allows for the inference of more complex models of selection from time-series genetic data, while also quantifying uncertainty in the inferred parameters.

Keywords: Bayesian inference; diffusion; epistasis; linkage; longitudinal data; path integral; selection; time-series data.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Epistasis, Genetic*
  • Genetic Fitness
  • Genetic Linkage
  • Models, Genetic
  • Mutation
  • Selection, Genetic*