ProGeo-Neo v2.0: A One-Stop Software for Neoantigen Prediction and Filtering Based on the Proteogenomics Strategy

Genes (Basel). 2022 Apr 28;13(5):783. doi: 10.3390/genes13050783.

Abstract

A proteogenomics-based neoantigen prediction pipeline, namely ProGeo-neo, was previously developed by our team to predict neoantigens, allowing the identification of class-I major histocompatibility complex (MHC) binding peptides based on single-nucleotide variation (SNV) mutations. To improve it, we here present an updated pipeline, i.e., ProGeo-neo v2.0, in which a one-stop software solution was proposed to identify neoantigens based on the paired tumor-normal whole genome sequencing (WGS)/whole exome sequencing (WES) data in FASTQ format. Preferably, in ProGeo-neo v2.0, several new features are provided. In addition to the identification of MHC-I neoantigens, the new version supports the prediction of MHC class II-restricted neoantigens, i.e., peptides up to 30-mer in length. Moreover, the source of neoantigens has been expanded, allowing more candidate neoantigens to be identified, such as in-frame insertion-deletion (indels) mutations, frameshift mutations, and gene fusion analysis. In addition, we propose two more efficient screening approaches, including an in-group authentic neoantigen peptides database and two more stringent thresholds. The range of candidate peptides was effectively narrowed down to those that are more likely to elicit an immune response, providing a more meaningful reference for subsequent experimental validation. Compared to ProGeo-neo, the ProGeo-neo v2.0 performed well based on the same dataset, including updated functionality and improved accuracy.

Keywords: bioinformatics; neoantigen; proteogenomic; tumor immunotherapy.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Antigens, Neoplasm / genetics
  • Humans
  • Neoplasms* / genetics
  • Peptides
  • Proteogenomics*
  • Software

Substances

  • Antigens, Neoplasm
  • Peptides

Grants and funding

This work was supported by the National Natural Science Foundation of China under Grant [31870829]; Shanghai Municipal Health Commission Collaborative Innovation Cluster Project under Grant [2019CXJQ02].