A two-stage classifier for identification of protein-protein interface residues

Bioinformatics. 2004 Aug 4:20 Suppl 1:i371-8. doi: 10.1093/bioinformatics/bth920.

Abstract

Motivation: The ability to identify protein-protein interaction sites and to detect specific amino acid residues that contribute to the specificity and affinity of protein interactions has important implications for problems ranging from rational drug design to analysis of metabolic and signal transduction networks.

Results: We have developed a two-stage method consisting of a support vector machine (SVM) and a Bayesian classifier for predicting surface residues of a protein that participate in protein-protein interactions. This approach exploits the fact that interface residues tend to form clusters in the primary amino acid sequence. Our results show that the proposed two-stage classifier outperforms previously published sequence-based methods for predicting interface residues. We also present results obtained using the two-stage classifier on an independent test set of seven CAPRI (Critical Assessment of PRedicted Interactions) targets. The success of the predictions is validated by examining the predictions in the context of the three-dimensional structures of protein complexes.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Amino Acids / chemistry*
  • Binding Sites
  • Computer Simulation
  • Models, Chemical*
  • Molecular Sequence Data
  • Protein Binding
  • Protein Interaction Mapping / methods*
  • Proteins / chemistry*
  • Sequence Analysis, Protein / methods*

Substances

  • Amino Acids
  • Proteins