ProtPipe: A Multifunctional Data Analysis Pipeline for Proteomics and Peptidomics

bioRxiv [Preprint]. 2023 Dec 13:2023.12.12.571327. doi: 10.1101/2023.12.12.571327.

Abstract

Mass spectrometry (MS) is a technique widely employed for the identification and characterization of proteins, personalized medicine, systems biology and biomedical applications. By combining MS with different proteomics approaches such as immunopurification MS, immunopeptidomics, and total protein proteomics, researchers can gain insights into protein-protein interactions, immune responses, cellular processes, and disease mechanisms. The application of MS-based proteomics in these areas continues to advance our understanding of protein function, cellular signaling, and complex biological systems. Data analysis for mass spectrometry is a critical process that includes identifying and quantifying proteins and peptides and exploring biological functions for these proteins in downstream analysis. To address the complexities associated with MS data analysis, we developed ProtPipe to streamline and automate the processing and analysis of high-throughput proteomics and peptidomics datasets. The pipeline facilitates data quality control, sample filtering, and normalization, ensuring robust and reliable downstream analysis. ProtPipe provides downstream analysis including identifying differential abundance proteins and peptides, pathway enrichment analysis, protein-protein interaction analysis, and MHC1-peptide binding affinity. ProtPipe generates annotated tables and diagnostic visualizations from statistical postprocessing and computation of fold-changes across pairwise conditions, predefined in an experimental design. ProtPipe is well-documented open-source software and is available at https://github.com/NIH-CARD/ProtPipe , accompanied by a web interface.

Publication types

  • Preprint