Progress and challenges in predicting protein-protein interaction sites

Brief Bioinform. 2009 May;10(3):233-46. doi: 10.1093/bib/bbp021. Epub 2009 Apr 3.

Abstract

The identification of protein-protein interaction sites is an essential intermediate step for mutant design and the prediction of protein networks. In recent years a significant number of methods have been developed to predict these interface residues and here we review the current status of the field. Progress in this area requires a clear view of the methodology applied, the data sets used for training and testing the systems, and the evaluation procedures. We have analysed the impact of a representative set of features and algorithms and highlighted the problems inherent in generating reliable protein data sets and in the posterior analysis of the results. Although it is clear that there have been some improvements in methods for predicting interacting sites, several major bottlenecks remain. Proteins in complexes are still under-represented in the structural databases and in particular many proteins involved in transient complexes are still to be crystallized. We provide suggestions for effective feature selection, and make it clear that community standards for testing, training and performance measures are necessary for progress in the field.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Binding Sites
  • Databases, Protein
  • Multiprotein Complexes / chemistry
  • Multiprotein Complexes / metabolism
  • Protein Conformation*
  • Protein Interaction Mapping* / methods
  • Proteins / chemistry*
  • Proteins / genetics
  • Proteins / metabolism*
  • Static Electricity
  • Surface Properties

Substances

  • Multiprotein Complexes
  • Proteins