Analysis of gene duplication within the Arabidopsis NUCLEAR FACTOR Y, subunit B (NF-YB) protein family reveals domains under both purifying and diversifying selection

PLoS One. 2023 Aug 2;18(8):e0289332. doi: 10.1371/journal.pone.0289332. eCollection 2023.

Abstract

Gene duplication is an evolutionary mechanism that provides new genetic material. Since gene duplication is a major driver for molecular evolution, examining the fate of duplicated genes is an area of active research. The fate of duplicated genes can include loss, subfunctionalization, and neofunctionalization. In this manuscript, we chose to experimentally study the fate of duplicated genes using the Arabidopsis NUCLEAR FACTOR Y (NF-Y) transcription factor family. NF-Y transcription factors are heterotrimeric complexes, composed of NF-YA, NF-YB, and NF-YC. NF-YA subunits are responsible for nucleotide-specific binding to a CCAAT cis-regulatory element. NF-YB and NF-YC subunits make less specific, but essential complex-stabilizing contacts with the DNA flanking the core CCAAT pentamer. While ubiquitous in eukaryotes, each NF-Y family has expanded by duplication in the plant lineage. For example, the model plant Arabidopsis contains 10 each of the NF-Y subunits. Here we examine the fate of duplicated NF-YB proteins in Arabidopsis, which are composed of central histone fold domains (HFD) and less conserved flanking regions (N- and C-termini). Specifically, the principal question we wished to address in this manuscript was to what extent can the 10 Arabidopsis NF-YB paralogs functionally substitute the genes NF-YB2 and NF-YB3 in the promotion of photoperiodic flowering? Our results demonstrate that the conserved histone fold domains (HFD) may be under pressure for purifying (negative) selection, while the non-conserved N- and C-termini may be under pressure for diversifying (positive) selection, which explained each paralog's ability to substitute. In conclusion, our data demonstrate that the N- and C-termini may have allowed the duplicated genes to undergo functional diversification, allowing the retention of the duplicated genes.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Arabidopsis Proteins* / genetics
  • Arabidopsis Proteins* / metabolism
  • Arabidopsis* / genetics
  • Arabidopsis* / metabolism
  • CCAAT-Binding Factor / genetics
  • CCAAT-Binding Factor / metabolism
  • Gene Duplication
  • Histones / metabolism
  • Transcription Factors / genetics

Substances

  • nuclear factor Y
  • Histones
  • CCAAT-Binding Factor
  • Transcription Factors
  • NF-YB2 protein, Arabidopsis
  • Arabidopsis Proteins

Grants and funding

Funding support for this project came National Science Foundation (US, http://www.nsf.gov/) award 1149822 to BFH and the Texas A&M University-Central Texas, College of Arts and Science, FSRC grant to CLS. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.