Progress and challenges in predicting protein-protein interaction sites

Iakes Ezkurdia; Lisa Bartoli; Piero Fariselli; Rita Casadio; Alfonso Valencia; Michael L Tress

doi:10.1093/bib/bbp021

Progress and challenges in predicting protein-protein interaction sites

Brief Bioinform. 2009 May;10(3):233-46. doi: 10.1093/bib/bbp021. Epub 2009 Apr 3.

Authors

Iakes Ezkurdia¹, Lisa Bartoli, Piero Fariselli, Rita Casadio, Alfonso Valencia, Michael L Tress

Affiliation

¹ Centro Nacional de Biotechnolgia, Spanish National Cancer Research Centre (CNIO), Madrid, Spain.

PMID: 19346321
DOI: 10.1093/bib/bbp021

Abstract

The identification of protein-protein interaction sites is an essential intermediate step for mutant design and the prediction of protein networks. In recent years a significant number of methods have been developed to predict these interface residues and here we review the current status of the field. Progress in this area requires a clear view of the methodology applied, the data sets used for training and testing the systems, and the evaluation procedures. We have analysed the impact of a representative set of features and algorithms and highlighted the problems inherent in generating reliable protein data sets and in the posterior analysis of the results. Although it is clear that there have been some improvements in methods for predicting interacting sites, several major bottlenecks remain. Proteins in complexes are still under-represented in the structural databases and in particular many proteins involved in transient complexes are still to be crystallized. We provide suggestions for effective feature selection, and make it clear that community standards for testing, training and performance measures are necessary for progress in the field.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Binding Sites
Databases, Protein
Multiprotein Complexes / chemistry
Multiprotein Complexes / metabolism
Protein Conformation*
Protein Interaction Mapping* / methods
Proteins / chemistry*
Proteins / genetics
Proteins / metabolism*
Static Electricity
Surface Properties

Substances

Multiprotein Complexes
Proteins