G2GSnake: a Snakemake workflow for host-pathogen genomic association studies

Bioinform Adv. 2023 Oct 4;3(1):vbad142. doi: 10.1093/bioadv/vbad142. eCollection 2023.

Abstract

Summary: Joint analyses of paired host and pathogen genome sequences have the potential to enhance our understanding of host-pathogen interactions. A systematic approach to conduct such a joint analysis is through a "genome-to-genome" (G2G) association study, which involves testing for associations between all host and pathogen genetic variants. Significant associations reveal host genetic factors that might drive pathogen variation, highlighting biological mechanisms likely to be involved in host control and pathogen escape. Here, we present a Snakemake workflow that allows researchers to conduct G2G studies in a reproducible and scalable manner. In addition, we have developed an intuitive R Shiny application that generates custom summaries of the results, enabling users to derive relevant insights.

Availability and implementation: G2GSnake is freely available at: https://github.com/zmx21/G2GSnake under the MIT license.