Improved prediction of allergenicity by combination of multiple sequence motifs

In Silico Biol. 2007;7(1):77-86.

Abstract

The identification and validation of protein allergens have become more important nowadays as more and more transgenic proteins are introduced into our food chains. Current allergen prediction algorithms focus on the identification of single motif or single allergen peptide for allergen detection. However, an analysis of the 575 allergen dataset shows that most allergens contain multiple motifs. Here, we present a novel algorithm that detects allergen by making use of combinations of motifs. Sensitivity of 0.772 and specificity of 0.904 were achieved by the proposed algorithm to predict allergen. The specificity of the proposed approach is found to be significantly higher than traditional single motif approaches. The high specificity of the proposed algorithm is useful in filtering out false positives, especially when laboratory resources are limited.

MeSH terms

  • Algorithms
  • Allergens / chemistry*
  • Amino Acid Motifs
  • Amino Acid Sequence
  • Genes, Plant
  • Molecular Sequence Data
  • Pattern Recognition, Automated
  • Plant Proteins / chemistry
  • Plants, Genetically Modified
  • Protein Structure, Tertiary
  • Sensitivity and Specificity
  • Sequence Analysis, Protein
  • Software

Substances

  • Allergens
  • Plant Proteins