Predicting protein subcellular location using Chou's pseudo amino acid composition and improved hybrid approach

Protein Pept Lett. 2008;15(6):612-6. doi: 10.2174/092986608784966930.

Abstract

The location of a protein in a cell is closely correlated with its biological function. Based on the concept that the protein subcellular location is mainly determined by its amino acid and pseudo amino acid composition (PseAA), a new algorithm of increment of diversity combined with support vector machine is proposed to predict the protein subcellular location. The subcellular locations of plant and non-plant proteins are investigated by our method. The overall prediction accuracies in jackknife test are 88.3% for the eukaryotic plant proteins and 92.4% for the eukaryotic non-plant proteins, respectively. In order to estimate the effect of the sequence identity on predictive result, the proteins with sequence identity <or=40% are selected. The overall success rates of prediction are 86.2% and 92.3% for plant and non-plant proteins in jackknife test, respectively.

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Eukaryotic Cells / cytology
  • Eukaryotic Cells / metabolism
  • Intracellular Space / metabolism*
  • Plant Cells
  • Plant Proteins / chemistry
  • Plant Proteins / metabolism
  • Plants / metabolism
  • Proteins / chemistry
  • Proteins / metabolism*
  • Sequence Analysis, Protein

Substances

  • Plant Proteins
  • Proteins