Protein interaction detection in sentences via Gaussian processes: a preliminary evaluation

Int J Data Min Bioinform. 2011;5(1):52-72. doi: 10.1504/ijdmb.2011.038577.

Abstract

The non-parametric deterministic Support Vector Machines (SVMs) produce high levels of performances in text classification. This article offers a much needed evaluation of the Gaussian Process (GP) classifier, as a non-parametric probabilistic analogue to SVMs, which has been rarely applied to text classification. We provide an extensive experimental comparison of the performance and properties of these competing classifiers on the challenging problem of protein interaction detection in biomedical publications. Our results show that GPs can match the performance of SVMs without the need for costly margin parameter tuning, whilst offering the advantage of an extendable probabilistic framework for text classification.

Publication types

  • Evaluation Study

MeSH terms

  • Algorithms*
  • Data Mining / methods*
  • Models, Statistical
  • Normal Distribution
  • Pattern Recognition, Automated / methods
  • Protein Interaction Mapping*
  • Proteins / chemistry
  • Proteins / metabolism

Substances

  • Proteins