Prediction of collagen stability from amino acid sequence

J Biol Chem. 2005 May 13;280(19):19343-9. doi: 10.1074/jbc.M501657200. Epub 2005 Mar 7.

Abstract

An algorithm was derived to relate the amino acid sequence of a collagen triple helix to its thermal stability. This calculation is based on the triple helical stabilization propensities of individual residues and their intermolecular and intramolecular interactions, as quantitated by melting temperature values of host-guest peptides. Experimental melting temperature values of a number of triple helical peptides of varying length and sequence were successfully predicted by this algorithm. However, predicted T(m) values are significantly higher than experimental values when there are strings of oppositely charged residues or concentrations of like charges near the terminus. Application of the algorithm to collagen sequences highlights regions of unusually high or low stability, and these regions often correlate with biologically significant features. The prediction of stability from sequence indicates an understanding of the major forces maintaining this protein motif. The use of highly favorable KGE and KGD sequences is seen to complement the stabilizing effects of imino acids in modulating stability and may become dominant in the collagenous domains of bacterial proteins that lack hydroxyproline. The effect of single amino acid mutations in the X and Y positions can be evaluated with this algorithm. An interactive collagen stability calculator based on this algorithm is available online.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms
  • Amino Acid Motifs
  • Amino Acids / chemistry
  • Animals
  • Collagen / chemistry*
  • Collagen Type I / chemistry
  • Collagen Type II / chemistry
  • Dimerization
  • Humans
  • Hydroxyproline / chemistry
  • Mutation
  • Peptides / chemistry
  • Protein Conformation
  • Protein Folding
  • Protein Structure, Secondary
  • Protein Structure, Tertiary
  • Proteomics / methods*
  • Static Electricity
  • Temperature

Substances

  • Amino Acids
  • Collagen Type I
  • Collagen Type II
  • Peptides
  • Collagen
  • Hydroxyproline