BEAN and HABAS: Polyphyletic insertions in the DNA-directed RNA polymerase

Protein Sci. 2024 Nov;33(11):e5194. doi: 10.1002/pro.5194.

Abstract

The β and β' subunits of the RNA polymerase (RNAP) are large proteins with complex multi-domain architectures that include several insertional domains. Here, we analyze the domain organizations of RNAP-β and RNAP-β' using sequence, experimentally determined structures and AlphaFold structure predictions. We observe that lineage-specific insertional domains in bacterial RNAP-β belong to a group that we call BEAN (broadly embedded annex). We observe that lineage-specific insertional domains in bacterial RNAP-β' belong to a group that we call HABAS (hammerhead/barrel-sandwich hybrid). The BEAN domain has a characteristic three-dimensional structure composed of two square bracket-like elements that are antiparallel relative to each other. The HABAS domain contains a four-stranded open β-sheet with a GD-box-like motif in one of the β-strands and the adjoining loop. The BEAN domain is inserted not only in the bacterial RNAP-β', but also in the archaeal version of universal ribosomal protein L10. The HABAS domain is inserted in several metabolic proteins. The phylogenetic distributions of bacterial lineage-specific insertional domains of β and β' subunits of RNAP follow the Tree of Life. The presence of insertional domains can help establish a relative timeline of events in the evolution of a protein because insertion is inferred to post-date the base domain. We discuss mechanisms that might account for the discovery of homologous insertional domains in non-equivalent locations in bacteria and archaea.

Keywords: bacteria; insertional domains; protein evolution; transcription.

MeSH terms

  • Amino Acid Sequence
  • Bacterial Proteins / chemistry
  • Bacterial Proteins / genetics
  • Bacterial Proteins / metabolism
  • DNA-Directed RNA Polymerases* / chemistry
  • DNA-Directed RNA Polymerases* / genetics
  • DNA-Directed RNA Polymerases* / metabolism
  • Models, Molecular
  • Phylogeny
  • Protein Domains

Substances

  • DNA-Directed RNA Polymerases
  • Bacterial Proteins
  • RNA polymerase beta subunit