Novel insights from the Plasmodium falciparum sporozoite-specific proteome by probabilistic integration of 26 studies

PLoS Comput Biol. 2021 Apr 30;17(4):e1008067. doi: 10.1371/journal.pcbi.1008067. eCollection 2021 Apr.

Abstract

Plasmodium species, the causative agent of malaria, have a complex life cycle involving two hosts. The sporozoite life stage is characterized by an extended phase in the mosquito salivary glands followed by free movement and rapid invasion of hepatocytes in the human host. This transmission stage has been the subject of many transcriptomics and proteomics studies and is also targeted by the most advanced malaria vaccine. We applied Bayesian data integration to determine which proteins are not only present in sporozoites but are also specific to that stage. Transcriptomic and proteomic Plasmodium data sets from 26 studies were weighted for how representative they are for sporozoites, based on a carefully assembled gold standard for Plasmodium falciparum (Pf) proteins known to be present or absent during the sporozoite life stage. Of 5418 Pf genes for which expression data were available at the RNA level or at the protein level, 975 were identified as enriched in sporozoites and 90 specific to them. We show that Pf sporozoites are enriched for proteins involved in type II fatty acid synthesis in the apicoplast and GPI anchor synthesis, but otherwise appear metabolically relatively inactive in the salivary glands of mosquitos. Newly annotated hypothetical sporozoite-specific and sporozoite-enriched proteins highlight sporozoite-specific functions. They include PF3D7_0104100 that we identified to be homologous to the prominin family, which in human has been related to a quiescent state of cancer cells. We document high levels of genetic variability for sporozoite proteins, specifically for sporozoite-specific proteins that elicit antibodies in the human host. Nevertheless, we can identify nine relatively well-conserved sporozoite proteins that elicit antibodies and that together can serve as markers for previous exposure. Our understanding of sporozoite biology benefits from identifying key pathways that are enriched during this life stage. This work can guide studies of molecular mechanisms underlying sporozoite biology and potential well-conserved targets for marker and drug development.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Bayes Theorem
  • Plasmodium falciparum / genetics
  • Plasmodium falciparum / metabolism*
  • Polymorphism, Single Nucleotide
  • Probability
  • Proteome*
  • Protozoan Proteins / metabolism*
  • Sporozoites / metabolism*
  • Transcriptome

Substances

  • Proteome
  • Protozoan Proteins

Grants and funding

A.Y. is supported by a Veni grant (VI.Veni.192.171). LM-K was funded by a grant from the Radboud University Nijmegen Medical Centre. NIP, RS and AY are supported by the European Union's Horizon 2020 research and innovation program under grant agreement No. 733273. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.