Beyond RuBisCO: convergent molecular evolution of multiple chloroplast genes in C4 plants

PeerJ. 2022 Jan 27:10:e12791. doi: 10.7717/peerj.12791. eCollection 2022.

Abstract

Background: The recurrent evolution of the C4 photosynthetic pathway in angiosperms represents one of the most extraordinary examples of convergent evolution of a complex trait. Comparative genomic analyses have unveiled some of the molecular changes associated with the C4 pathway. For instance, several key enzymes involved in the transition from C3 to C4 photosynthesis have been found to share convergent amino acid replacements along C4 lineages. However, the extent of convergent replacements potentially associated with the emergence of C4 plants remains to be fully assessed. Here, we conducted an organelle-wide analysis to determine if convergent evolution occurred in multiple chloroplast proteins beside the well-known case of the large RuBisCO subunit encoded by the chloroplast gene rbcL.

Methods: Our study was based on the comparative analysis of 43 C4 and 21 C3 grass species belonging to the PACMAD clade, a focal taxonomic group in many investigations of C4 evolution. We first used protein sequences of 67 orthologous chloroplast genes to build an accurate phylogeny of these species. Then, we inferred amino acid replacements along 13 C4 lineages and 9 C3 lineages using reconstructed protein sequences of their reference branches, corresponding to the branches containing the most recent common ancestors of C4-only clades and C3-only clades. Pairwise comparisons between reference branches allowed us to identify both convergent and non-convergent amino acid replacements between C4:C4, C3:C3 and C3:C4 lineages.

Results: The reconstructed phylogenetic tree of 64 PACMAD grasses was characterized by strong supports in all nodes used for analyses of convergence. We identified 217 convergent replacements and 201 non-convergent replacements in 45/67 chloroplast proteins in both C4 and C3 reference branches. C4:C4 branches showed higher levels of convergent replacements than C3:C3 and C3:C4 branches. Furthermore, we found that more proteins shared unique convergent replacements in C4 lineages, with both RbcL and RpoC1 (the RNA polymerase beta' subunit 1) showing a significantly higher convergent/non-convergent replacements ratio in C4 branches. Notably, more C4:C4 reference branches showed higher numbers of convergent vs. non-convergent replacements than C3:C3 and C3:C4 branches. Our results suggest that, in the PACMAD clade, C4 grasses experienced higher levels of molecular convergence than C3 species across multiple chloroplast genes. These findings have important implications for our understanding of the evolution of the C4 photosynthesis pathway.

Keywords: C3 photosynthesis; C4 photosynthesis; Convergent evolution; Grasses; Molecular evolution; PACMAD; Poaceae.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Chloroplast Proteins / genetics
  • Evolution, Molecular
  • Genes, Chloroplast*
  • Phylogeny
  • Plants / genetics
  • Poaceae
  • Ribulose-Bisphosphate Carboxylase* / genetics

Substances

  • Ribulose-Bisphosphate Carboxylase
  • Chloroplast Proteins

Associated data

  • figshare/10.6084/m9.figshare.15180690

Grants and funding

The project was supported by the National Institute of Food and Agriculture, U.S. Department of Agriculture, under award number TEX0-1-9599, the Texas A&M AgriLife Research, and the Texas A&M Forest Service. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.