Positive-unlabeled learning identifies vaccine candidate antigens in the malaria parasite Plasmodium falciparum

NPJ Syst Biol Appl. 2024 Apr 27;10(1):44. doi: 10.1038/s41540-024-00365-1.

Abstract

Malaria vaccine development is hampered by extensive antigenic variation and complex life stages of Plasmodium species. Vaccine development has focused on a small number of antigens, many of which were identified without utilizing systematic genome-level approaches. In this study, we implement a machine learning-based reverse vaccinology approach to predict potential new malaria vaccine candidate antigens. We assemble and analyze P. falciparum proteomic, structural, functional, immunological, genomic, and transcriptomic data, and use positive-unlabeled learning to predict potential antigens based on the properties of known antigens and remaining proteins. We prioritize candidate antigens based on model performance on reference antigens with different genetic diversity and quantify the protein properties that contribute most to identifying top candidates. Candidate antigens are characterized by gene essentiality, gene ontology, and gene expression in different life stages to inform future vaccine development. This approach provides a framework for identifying and prioritizing candidate vaccine antigens for a broad range of pathogens.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Antigens, Protozoan* / genetics
  • Antigens, Protozoan* / immunology
  • Computational Biology / methods
  • Humans
  • Machine Learning
  • Malaria Vaccines* / immunology
  • Malaria, Falciparum* / immunology
  • Malaria, Falciparum* / prevention & control
  • Plasmodium falciparum* / genetics
  • Plasmodium falciparum* / immunology
  • Proteomics / methods
  • Protozoan Proteins / genetics
  • Protozoan Proteins / immunology
  • Vaccine Development / methods

Substances

  • Malaria Vaccines
  • Antigens, Protozoan
  • Protozoan Proteins