Protein meta-functional signatures from combining sequence, structure, evolution, and amino acid property information

PLoS Comput Biol. 2008 Sep 26;4(9):e1000181. doi: 10.1371/journal.pcbi.1000181.

Abstract

Protein function is mediated by different amino acid residues, both their positions and types, in a protein sequence. Some amino acids are responsible for the stability or overall shape of the protein, playing an indirect role in protein function. Others play a functionally important role as part of active or binding sites of the protein. For a given protein sequence, the residues and their degree of functional importance can be thought of as a signature representing the function of the protein. We have developed a combination of knowledge- and biophysics-based function prediction approaches to elucidate the relationships between the structural and the functional roles of individual residues and positions. Such a meta-functional signature (MFS), which is a collection of continuous values representing the functional significance of each residue in a protein, may be used to study proteins of known function in greater detail and to aid in experimental characterization of proteins of unknown function. We demonstrate the superior performance of MFS in predicting protein functional sites and also present four real-world examples to apply MFS in a wide range of settings to elucidate protein sequence-structure-function relationships. Our results indicate that the MFS approach, which can combine multiple sources of information and also give biological interpretation to each component, greatly facilitates the understanding and characterization of protein function.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Amino Acids / chemistry
  • Bacterial Proteins / chemistry
  • Bacterial Proteins / genetics
  • Bacterial Proteins / physiology
  • Binding Sites
  • Cellulose 1,4-beta-Cellobiosidase / chemistry
  • Cellulose 1,4-beta-Cellobiosidase / genetics
  • Cellulose 1,4-beta-Cellobiosidase / physiology
  • Computational Biology / methods*
  • Computer Simulation
  • Conserved Sequence
  • Databases, Protein / statistics & numerical data
  • Evolution, Molecular
  • Internet
  • Models, Chemical
  • Models, Genetic
  • Models, Molecular*
  • Molecular Structure
  • Mutagenesis, Site-Directed
  • Ornithine Decarboxylase / chemistry
  • Ornithine Decarboxylase / genetics
  • Ornithine Decarboxylase / physiology
  • Protein Interaction Domains and Motifs
  • Protein Structure, Tertiary
  • Proteins / chemistry*
  • Proteins / genetics*
  • Proteins / physiology
  • Regression Analysis
  • Sequence Alignment / statistics & numerical data
  • Thermodynamics

Substances

  • Amino Acids
  • Bacterial Proteins
  • Proteins
  • Cellulose 1,4-beta-Cellobiosidase
  • Ornithine Decarboxylase