The Repeating, Modular Architecture of the HtrA Proteases

Biomolecules. 2022 Jun 7;12(6):793. doi: 10.3390/biom12060793.

Abstract

A conserved, 26-residue sequence [AA(X2)[A/G][G/L](X2)GDV[I/L](X2)[V/L]NGE(X1)V(X6)] and corresponding structure repeating module were identified within the HtrA protease family using a non-redundant set (N = 20) of publicly available structures. While the repeats themselves were far from sequence perfect, they had notable conservation to a statistically significant level. Three or more repetitions were identified within each protein despite being statistically expected to randomly occur only once per 1031 residues. This sequence repeat was associated with a six stranded antiparallel β-barrel module, two of which are present in the core of the structures of the PA clan of serine proteases, while a modified version of this module could be identified in the PDZ-like domains. Automated structural alignment methods had difficulties in superimposing these β-barrels, but the use of a target human HtrA2 structure showed that these modules had an average RMSD across the set of structures of less than 2 Å (mean and median). Our findings support Dayhoff's hypothesis that complex proteins arose through duplication of simpler peptide motifs and domains.

Keywords: HtrA protease; PA clan; protein evolution; protein repeat; serine protease.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Peptides / chemistry
  • Serine Endopeptidases* / metabolism
  • Serine Proteases* / chemistry
  • Serine Proteases* / genetics

Substances

  • Peptides
  • Serine Proteases
  • Serine Endopeptidases

Grants and funding

This work was supported by the National Science Centre, Poland, grant #2014/15/D/NZ1/00968 to M.W.G and by the European Union’s Horizon 2020 research and innovation programme under GA 952334 (PhasAGE) to S.M.-R.