Deciphering Protein Secretion from the Brain to Cerebrospinal Fluid for Biomarker Discovery

Katharina Waury; Renske de Wit; Inge M W Verberk; Charlotte E Teunissen; Sanne Abeln

doi:10.1021/acs.jproteome.3c00366

Deciphering Protein Secretion from the Brain to Cerebrospinal Fluid for Biomarker Discovery

J Proteome Res. 2023 Sep 1;22(9):3068-3080. doi: 10.1021/acs.jproteome.3c00366. Epub 2023 Aug 22.

Authors

Katharina Waury¹, Renske de Wit¹, Inge M W Verberk², Charlotte E Teunissen², Sanne Abeln¹

Affiliations

¹ Department of Computer Science, Vrije Universiteit Amsterdam, 1081 HV Amsterdam, The Netherlands.
² Neurochemistry Laboratory, Department of Clinical Chemistry, Amsterdam Neuroscience, VU University Medical Center, Amsterdam UMC, 1081 HV Amsterdam, The Netherlands.

Abstract

Cerebrospinal fluid (CSF) is an essential matrix for the discovery of neurological disease biomarkers. However, the high dynamic range of protein concentrations in CSF hinders the detection of the least abundant protein biomarkers by untargeted mass spectrometry. It is thus beneficial to gain a deeper understanding of the secretion processes within the brain. Here, we aim to explore if and how the secretion of brain proteins to the CSF can be predicted. By combining a curated CSF proteome and the brain elevated proteome of the Human Protein Atlas, brain proteins were classified as CSF or non-CSF secreted. A machine learning model was trained on a range of sequence-based features to differentiate between CSF and non-CSF groups and effectively predict the brain origin of proteins. The classification model achieves an area under the curve of 0.89 if using high confidence CSF proteins. The most important prediction features include the subcellular localization, signal peptides, and transmembrane regions. The classifier generalized well to the larger brain detected proteome and is able to correctly predict novel CSF proteins identified by affinity proteomics. In addition to elucidating the underlying mechanisms of protein secretion, the trained classification model can support biomarker candidate selection.

Keywords: brain proteome; cerebrospinal fluid; fluid biomarker; machine learning; protein secretion.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Biological Transport
Biomedical Research*
Brain
Cerebrospinal Fluid Proteins
Humans
Protein Transport
Proteome*

Substances

Proteome
Cerebrospinal Fluid Proteins