Boosting classifier for predicting protein domain structural class

Biochem Biophys Res Commun. 2005 Aug 19;334(1):213-7. doi: 10.1016/j.bbrc.2005.06.075.

Abstract

A novel classifier, the so-called "LogitBoost" classifier, was introduced to predict the structural class of a protein domain according to its amino acid sequence. LogitBoost is featured by introducing a log-likelihood loss function to reduce the sensitivity to noise and outliers, as well as by performing classification via combining many weak classifiers together to build up a very strong and robust classifier. It was demonstrated thru jackknife cross-validation tests that LogitBoost outperformed other classifiers including "support vector machine," a very powerful classifier widely used in biological literatures. It is anticipated that LogitBoost can also become a useful vehicle in classifying other attributes of proteins according to their sequences, such as subcellular localization and enzyme family class, among many others.

Publication types

  • Evaluation Study

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Artificial Intelligence*
  • Computer Simulation
  • Models, Chemical*
  • Models, Molecular*
  • Molecular Sequence Data
  • Protein Conformation
  • Protein Structure, Tertiary
  • Proteins / analysis
  • Proteins / chemistry*
  • Proteins / classification*
  • Sequence Alignment / methods
  • Sequence Analysis, Protein / methods*
  • Structure-Activity Relationship

Substances

  • Proteins