Proteome-wide non-cleavable crosslink identification with MS Annika 3.0 reveals the structure of the C. elegans Box C/D complex

Micha J Birklbauer; Fränze Müller; Sowmya Sivakumar Geetha; Manuel Matzinger; Karl Mechtler; Viktoria Dorfer

doi:10.1038/s42004-024-01386-x

Proteome-wide non-cleavable crosslink identification with MS Annika 3.0 reveals the structure of the C. elegans Box C/D complex

Commun Chem. 2024 Dec 19;7(1):300. doi: 10.1038/s42004-024-01386-x.

Authors

Micha J Birklbauer^{1

2}, Fränze Müller³, Sowmya Sivakumar Geetha^{4

5

6}, Manuel Matzinger³, Karl Mechtler^{3

7

8}, Viktoria Dorfer⁹

Affiliations

¹ Bioinformatics Research Group, University of Applied Sciences Upper Austria, Softwarepark 11, Hagenberg, 4232, Austria. [email protected].
² Institute for Symbolic Artificial Intelligence, Johannes Kepler University Linz, Altenberger Straße 69, Linz, 4040, Austria. [email protected].
³ Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Campus-Vienna-Biocenter 1, Vienna, 1030, Austria.
⁴ Max Perutz Labs (MPL), Vienna BioCenter (VBC), Dr. Bohr-Gasse 9/Vienna Biocenter 5, Vienna, 1030, Austria.
⁵ Max Perutz Labs (MPL), Department of Chromosome Biology, University of Vienna, Dr. Bohr-Gasse 9/Vienna Biocenter 5, Vienna, 1030, Austria.
⁶ Vienna BioCenter PhD Program, a Doctoral School of the University of Vienna and the Medical University of Vienna, Vienna BioCenter (VBC), Dr. Bohr-Gasse 9/Vienna Biocenter 5, Vienna, 1030, Austria.
⁷ Institute of Molecular Biotechnology (IMBA), Austrian Academy of Sciences, Vienna BioCenter (VBC), Dr. Bohr-Gasse 3, Vienna, 1030, Austria.
⁸ Gregor Mendel Institute (GMI), Austrian Academy of Sciences, Vienna BioCenter (VBC), Dr. Bohr-Gasse 3, Vienna, 1030, Austria.
⁹ Bioinformatics Research Group, University of Applied Sciences Upper Austria, Softwarepark 11, Hagenberg, 4232, Austria. [email protected].

Abstract

The field of crosslinking mass spectrometry has seen substantial advancements over the past decades, enabling the structural analysis of proteins and protein complexes and serving as a powerful tool in protein-protein interaction studies. However, data analysis of large non-cleavable crosslink studies is still a mostly unsolved problem due to its n-squared complexity. We here introduce an algorithm for the identification of non-cleavable crosslinks implemented in our crosslinking search engine MS Annika that is based on sparse matrix multiplication and allows for proteome-wide searches on commodity hardware. We compare our algorithm to other state-of-the-art crosslinking search engines commonly used in the field and conclude that MS Annika unifies high sensitivity, accurate FDR estimation and computational performance, outperforming competing tools. Application of this algorithm enabled us to employ a proteome-wide search of C. elegans nuclei samples, where we were able to uncover previously unknown protein interactions and conclude a comprehensive structural analysis that provides a detailed view of the Box C/D complex. Moreover, our algorithm will enable researchers to conduct similar studies that were previously unfeasible.

Abstract

Grants and funding