Predicting protein interaction interfaces from protein sequences: case studies of subtilisin and phycocyanin

Proteins. 2008 May 15;71(3):1461-74. doi: 10.1002/prot.21836.

Abstract

Identification of protein interaction interfaces is very important for understanding the molecular mechanisms underlying biological phenomena. Here, we present a novel method for predicting protein interaction interfaces from sequences by using PAM matrix (PIFPAM). Sequence alignments for interacting proteins were constructed and parsed into segments using sliding windows. By calculating distance matrix for each segment, the correlation coefficients between segments were estimated. The interaction interfaces were predicted by extracting highly correlated segment pairs from the correlation map. The predictions achieved an accuracy 0.41-0.71 for eight intraprotein interaction examples, and 0.07-0.60 for four interprotein interaction examples. Compared with three previously published methods, PIFPAM predicted more contacting site pairs for 11 out of the 12 example proteins, and predicted at least 34% more contacting site pairs for eight proteins of them. The factors affecting the predictions were also analyzed. Since PIFPAM uses only the alignments of the two interacting proteins as input, it is especially useful when no three-dimensional protein structure data are available.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Ligands
  • Phycocyanin / chemistry
  • Phycocyanin / metabolism*
  • Predictive Value of Tests
  • Protein Binding
  • Protein Interaction Mapping
  • Sequence Alignment
  • Sequence Analysis, Protein / methods
  • Subtilisin / chemistry
  • Subtilisin / metabolism*
  • Thermodynamics

Substances

  • Ligands
  • Phycocyanin
  • Subtilisin