Subfamily evolution analysis using nuclear and chloroplast data from the same reads

Sci Rep. 2025 Jan 3;15(1):687. doi: 10.1038/s41598-024-83292-9.

Abstract

The chloroplast (cp) genome is a widely used tool for exploring plant evolutionary relationships, yet its effectiveness in fully resolving these relationships remains uncertain. Integrating cp genome data with nuclear DNA information offers a more comprehensive view but often requires separate datasets. In response, we employed the same raw read sequencing data to construct cp genome-based trees and nuclear DNA phylogenetic trees using Read2Tree, a cost-efficient method for extracting conserved nuclear gene sequences from raw read data, focusing on the Aurantioideae subfamily, which includes Citrus and its relatives. The resulting nuclear DNA trees were consistent with existing nuclear evolutionary relationships derived from high-throughput sequencing, but diverged from cp genome-based trees. To elucidate the underlying complex evolutionary processes causing these discordances, we implemented an integrative workflow that utilized multiple alignments of each gene generated by Read2Tree, in conjunction with other phylogenomic methods. Our analysis revealed that incomplete lineage sorting predominantly drives these discordances, while introgression and ancient introgression also contribute to topological discrepancies within certain clades. This study underscores the cost-effectiveness of using the same raw sequencing data for both cp and nuclear DNA analyses in understanding plant evolutionary relationships.

Keywords: Aurantioideae subfamily; Chloroplast genome sequences; Incomplete lineage sorting; Introgression; Phylogenetic discordance; Read2Tree.

MeSH terms

  • Cell Nucleus* / genetics
  • Chloroplasts / genetics
  • Citrus / genetics
  • DNA, Chloroplast / genetics
  • Evolution, Molecular*
  • Genome, Chloroplast*
  • High-Throughput Nucleotide Sequencing / methods
  • Phylogeny*
  • Sequence Analysis, DNA / methods

Substances

  • DNA, Chloroplast