Prediction of subcellular location apoptosis proteins with ensemble classifier and feature selection

Amino Acids. 2010 Apr;38(4):975-83. doi: 10.1007/s00726-008-0209-4. Epub 2008 Dec 2.

Abstract

Apoptosis proteins have a central role in the development and the homeostasis of an organism. These proteins are very important for understanding the mechanism of programmed cell death. The function of an apoptosis protein is closely related to its subcellular location. It is crucial to develop powerful tools to predict apoptosis protein locations for rapidly increasing gap between the number of known structural proteins and the number of known sequences in protein databank. In this study, amino acids pair compositions with different spaces are used to construct feature sets for representing sample of protein feature selection approach based on binary particle swarm optimization, which is applied to extract effective feature. Ensemble classifier is used as prediction engine, of which the basic classifier is the fuzzy K-nearest neighbor. Each basic classifier is trained with different feature sets. Two datasets often used in prior works are selected to validate the performance of proposed approach. The results obtained by jackknife test are quite encouraging, indicating that the proposed method might become a potentially useful tool for subcellular location of apoptosis protein, or at least can play a complimentary role to the existing methods in the relevant areas. The supplement information and software written in Matlab are available by contacting the corresponding author.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Animals
  • Apoptosis Regulatory Proteins / chemistry*
  • Apoptosis Regulatory Proteins / classification
  • Apoptosis Regulatory Proteins / metabolism*
  • Computational Biology / methods*
  • Databases, Protein
  • Expert Systems
  • Fuzzy Logic
  • Humans
  • Sequence Analysis, Protein / methods*
  • Software
  • Subcellular Fractions / metabolism

Substances

  • Apoptosis Regulatory Proteins