CTKPred: an SVM-based method for the prediction and classification of the cytokine superfamily

Protein Eng Des Sel. 2005 Aug;18(8):365-8. doi: 10.1093/protein/gzi041. Epub 2005 Jun 24.

Abstract

Cell proliferation, differentiation and death are controlled by a multitude of cell-cell signals and loss of this control has devastating consequences. Prominent among these regulatory signals is the cytokine superfamily, which has crucial functions in the development, differentiation and regulation of immune cells. In this study, a support vector machine (SVM)-based method was developed for predicting families and subfamilies of cytokines using dipeptide composition. The taxonomy of the cytokine superfamily with which our method complies was described in the Cytokine Family cDNA Database (dbCFC) and the dataset used in this study for training and testing was obtained from the dbCFC and Structural Classification of Proteins (SCOP). The method classified cytokines and non-cytokines with an accuracy of 92.5% by 7-fold cross-validation. The method is further able to predict seven major classes of cytokine with an overall accuracy of 94.7%. A server for recognition and classification of cytokines based on multi-class SVMs has been set up at http://bioinfo.tsinghua.edu.cn/~huangni/CTKPred/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence
  • Computational Biology
  • Cytokines / chemistry*
  • Cytokines / classification
  • Databases, Protein
  • Dipeptides / chemistry*
  • Internet*
  • Sequence Analysis, Protein
  • Software

Substances

  • Cytokines
  • Dipeptides