The potential of genome-wide RAD sequences for resolving rapid radiations: a case study in Cactaceae

Mol Phylogenet Evol. 2020 Oct:151:106896. doi: 10.1016/j.ympev.2020.106896. Epub 2020 Jun 18.

Abstract

The reconstruction of relationships within recently radiated groups is challenging even when massive amounts of sequencing data are available. The use of restriction site-associated DNA sequencing (RAD-Seq) to this end is promising. Here, we assessed the performance of RAD-Seq to infer the species-level phylogeny of the rapidly radiating genus Cereus (Cactaceae). To examine how the amount of genomic data affects resolution in this group, we used datasets and implemented different analyses. We sampled 52 individuals of Cereus, representing 18 of the 25 species currently recognized, plus members of the closely allied genera Cipocereus and Praecereus, and other 11 Cactaceae genera as outgroups. Three scenarios of permissiveness to missing data were carried out in iPyRAD, assembling datasets with 30% (333 loci), 45% (1440 loci), and 70% (6141 loci) of missing data. For each dataset, Maximum Likelihood (ML) trees were generated using two supermatrices, i.e., only SNPs and SNPs plus invariant sites. Accuracy and resolution were improved when the dataset with the highest number of loci was used (6141 loci), despite the high percentage of missing data included (70%). Coalescent trees estimated using SVDQuartets and ASTRAL are similar to those obtained by the ML reconstructions. Overall, we reconstruct a well-supported phylogeny of Cereus, which is resolved as monophyletic and composed of four main clades with high support in their internal relationships. Our findings also provide insights into the impact of missing data for phylogeny reconstruction using RAD loci.

Keywords: Cereus; Missing data; Phylogenomics; Radiation; ddRAD-Seq.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Biological Evolution*
  • Cactaceae / genetics*
  • Databases, Genetic
  • Genetic Loci
  • Genetic Speciation
  • Genome, Plant*
  • Likelihood Functions
  • Phylogeny
  • Polymorphism, Single Nucleotide / genetics
  • Principal Component Analysis
  • Sequence Analysis, DNA*

Associated data

  • figshare/10.6084/m9.figshare.12678551.v1