EpicCapo: epitope prediction using combined information of amino acid pairwise contact potentials and HLA-peptide contact site information

BMC Bioinformatics. 2012 Nov 24:13:313. doi: 10.1186/1471-2105-13-313.

Abstract

Background: Epitope identification is an essential step toward synthetic vaccine development since epitopes play an important role in activating immune response. Classical experimental approaches are laborious and time-consuming, and therefore computational methods for generating epitope candidates have been actively studied. Most of these methods, however, are based on sophisticated nonlinear techniques for achieving higher predictive performance. The use of these techniques tend to diminish their interpretability with respect to binding potential: that is, they do not provide much insight into binding mechanisms.

Results: We have developed a novel epitope prediction method named EpicCapo and its variants, EpicCapo(+) and EpicCapo(+REF). Nonapeptides were encoded numerically using a novel peptide-encoding scheme for machine learning algorithms by utilizing 40 amino acid pairwise contact potentials (referred to as AAPPs throughout this paper). The predictive performances of EpicCapo(+) and EpicCapo(+REF) outperformed other state-of-the-art methods without losing interpretability. Interestingly, the most informative AAPPs estimated by our study were those developed by Micheletti and Simons while previous studies utilized two AAPPs developed by Miyazawa & Jernigan and Betancourt & Thirumalai. In addition, we found that all amino acid positions in nonapeptides could effect on performances of the predictive models including non-anchor positions. Finally, EpicCapo(+REF) was applied to identify candidates of promiscuous epitopes. As a result, 67.1% of the predicted nonapeptides epitopes were consistent with preceding studies based on immunological experiments.

Conclusions: Our method achieved high performance in testing with benchmark datasets. In addition, our study identified a number of candidates of promiscuous CTL epitopes consistent with previously reported immunological experiments. We speculate that our techniques may be useful in the development of new vaccines. The R implementation of EpicCapo(+REF) is available at http://pirun.ku.ac.th/~fsciiok/EpicCapoREF.zip. Datasets are available at http://pirun.ku.ac.th/~fsciiok/Datasets.zip.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Amino Acids / analysis
  • Epitopes / analysis*
  • Epitopes / chemistry
  • Epitopes / immunology
  • Epitopes / metabolism
  • HLA Antigens / analysis
  • HLA Antigens / chemistry
  • HLA Antigens / immunology
  • HLA Antigens / metabolism
  • Humans
  • Influenza A virus / immunology
  • Influenza Vaccines / immunology
  • Protein Binding
  • Support Vector Machine*
  • T-Lymphocytes, Cytotoxic / immunology

Substances

  • Amino Acids
  • Epitopes
  • HLA Antigens
  • Influenza Vaccines