Predicting enzymatic function from global binding site descriptors

Proteins. 2013 Mar;81(3):479-89. doi: 10.1002/prot.24205. Epub 2012 Dec 24.

Abstract

Due to the rising number of solved protein structures, computer-based techniques for automatic protein functional annotation and classification into families are of high scientific interest. DoGSiteScorer automatically calculates global descriptors for self-predicted pockets based on the 3D structure of a protein. Protein function predictors on three levels with increasing granularity are built by use of a support vector machine (SVM), based on descriptors of 26632 pockets from enzymes with known structure and enzyme classification. The SVM models represent a generalization of the available descriptor space for each enzyme class, subclass, and substrate-specific sub-subclass. Cross-validation studies show accuracies of 68.2% for predicting the correct main class and accuracies between 62.8% and 80.9% for the six subclasses. Substrate-specific recall rates for a kinase subset are 53.8%. Furthermore, application studies show the ability of the method for predicting the function of unknown proteins and gaining valuable information for the function prediction field.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Bacteria / chemistry
  • Bacteria / enzymology
  • Bacterial Proteins / chemistry
  • Catalytic Domain*
  • Computational Biology / methods
  • Databases, Protein
  • Enzyme Activation
  • Ligands
  • Molecular Sequence Annotation
  • Phosphotransferases / chemistry*
  • Phosphotransferases / classification
  • Structure-Activity Relationship
  • Substrate Specificity
  • Support Vector Machine*

Substances

  • Bacterial Proteins
  • Ligands
  • Phosphotransferases