Disulfide connectivity prediction based on structural information without a prior knowledge of the bonding state of cysteines

Comput Biol Med. 2013 Nov;43(11):1941-8. doi: 10.1016/j.compbiomed.2013.09.008. Epub 2013 Sep 18.

Abstract

Previous studies predicted the disulfide bonding patterns of cysteines using a prior knowledge of their bonding states. In this study, we propose a method that is based on the ensemble support vector machine (SVM), with the structural features of cysteines extracted without any prior knowledge of their bonding states. This method is useful for improving the predictive performance of disulfide bonding patterns. For comparison, the proposed method was tested with the same dataset SPX that was adopted in previous studies. The experimental results demonstrate that bridge classification and disulfide connectivity predictions achieve 96.5% and 89.2% accuracy, respectively, using the ensemble SVM model, which outperforms the traditional method (51.5% and 51.0%, respectively) and the model that is based on a single-kernel SVM classifier (94.6% and 84.4%, respectively). For protein chain and residue classifications, the sensitivity, specificity, and accuracy of ensemble and single-kernel SVM approaches are better than those of the traditional methods. The predictive performances of the ensemble SVM and single-kernel models are identical, indicating that the ensemble model can converge to the single-kernel model for some applications.

Keywords: Disulfide bonding pattern; Ensemble classifier; Multiple trajectory search; Support vector machine.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods*
  • Cysteine / chemistry*
  • Cysteine / metabolism
  • Databases, Protein
  • Disulfides / chemistry*
  • Disulfides / metabolism
  • Proteins / chemistry*
  • Proteins / metabolism
  • Support Vector Machine*

Substances

  • Disulfides
  • Proteins
  • Cysteine