Tree2GD: a phylogenomic method to detect large-scale gene duplication events

Bioinformatics. 2022 Nov 30;38(23):5317-5321. doi: 10.1093/bioinformatics/btac669.

Abstract

Motivation: Whole-genome duplication events have long been discovered throughout the evolution of eukaryotes, contributing to genome complexity and biodiversity and leaving traces in the descending organisms. Therefore, an accurate and rapid phylogenomic method is needed to identify the retained duplicated genes on various lineages across the target taxonomy.

Results: Here, we present Tree2GD, an integrated method to identify large-scale gene duplication events by automatically perform multiple procedures, including sequence alignment, recognition of homolog, gene tree/species tree reconciliation, Ks distribution of gene duplicates and synteny analyses. Application of Tree2GD on 2 datasets, 12 metazoan genomes and 68 angiosperms, successfully identifies all reported whole-genome duplication events exhibited by these species, showing effectiveness and efficiency of Tree2GD on phylogenomic analyses of large-scale gene duplications.

Availability and implementation: Tree2GD is written in Python and C++ and is available at https://github.com/Dee-chen/Tree2gd.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Eukaryota*
  • Gene Duplication*
  • Phylogeny
  • Sequence Alignment
  • Synteny