Studies of the macroevolutionary legacy of polyploidy are limited by an incomplete sampling of these events across the tree of life. To better locate and understand these events, we need comprehensive taxonomic sampling as well as homology inference methods that accurately reconstruct the frequency and location of gene duplications. We assembled a data set of transcriptomes and genomes from 168 species in Caryophyllales, of which 43 transcriptomes were newly generated for this study, representing one of the most densely sampled genomic-scale data sets available. We carried out phylogenomic analyses using a modified phylome strategy to reconstruct the species tree. We mapped the phylogenetic distribution of polyploidy events by both tree-based and distance-based methods, and explicitly tested scenarios for allopolyploidy. We identified 26 ancient and more recent polyploidy events distributed throughout Caryophyllales. Two of these events were inferred to be allopolyploidy. Through dense phylogenomic sampling, we show the propensity of polyploidy throughout the evolutionary history of Caryophyllales. We also provide a framework for utilizing transcriptome data to detect allopolyploidy, which is important as it may have different macroevolutionary implications compared with autopolyploidy.
Keywords: Caryophyllales; Ks plot; allopolyploidy; genome duplication; modified phylome; polyploidy.
© 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.