Finding biologically relevant protein domain interactions: conserved binding mode analysis

Protein Sci. 2006 Feb;15(2):352-61. doi: 10.1110/ps.051760806. Epub 2005 Dec 29.

Abstract

Proteins evolved through the shuffling of functional domains, and therefore, the same domain can be found in different proteins and species. Interactions between such conserved domains often involve specific, well-determined binding surfaces reflecting their important biological role in a cell. To find biologically relevant interactions we developed a method of systematically comparing and classifying protein domain interactions from the structural data. As a result, a set of conserved binding modes (CBMs) was created using the atomic detail of structure alignment data and the protein domain classification of the Conserved Domain Database. A conserved binding mode is inferred when different members of interacting domain families dock in the same way, such that their structural complexes superimpose well. Such domain interactions with recurring structural themes have greater significance to be biologically relevant, unlike spurious crystal packing interactions. Consequently, this study gives lower and upper bounds on the number of different types of interacting domain pairs in the structure database on the order of 1000-2000. We use CBMs to create domain interaction networks, which highlight functionally significant connections by avoiding many infrequent links between highly connected nodes. The CBMs also constitute a library of docking templates that may be used in molecular modeling to infer the characteristics of an unknown binding surface, just as conserved domains may be used to infer the structure of an unknown protein. The method's ability to sort through and classify large numbers of putative interacting domain pairs is demonstrated on the oligomeric interactions of globins.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Algorithms*
  • Binding Sites
  • Computer Simulation
  • Conserved Sequence
  • Databases, Protein*
  • Globins / chemistry*
  • Globins / classification
  • Globins / metabolism*
  • Models, Chemical*
  • Models, Molecular
  • Phylogeny
  • Protein Binding
  • Protein Interaction Mapping*
  • Protein Structure, Tertiary

Substances

  • Globins