ProtPipe: A Multifunctional Data Analysis Pipeline for Proteomics and Peptidomics

Genomics Proteomics Bioinformatics. 2024 Nov 22:qzae083. doi: 10.1093/gpbjnl/qzae083. Online ahead of print.

Abstract

Mass spectrometry (MS) is a technique widely employed for the identification and characterization of proteins, with personalized medicine, systems biology, and biomedical applications. The application of MS-based proteomics advances our understanding of protein function, cellular signaling, and complex biological systems. MS data analysis is a critical process that includes identifying and quantifying proteins and peptides and then exploring their biological functions in downstream analysis. To address the complexities associated with MS data analysis, we developed ProtPipe to streamline and automate the processing and analysis of high-throughput proteomics and peptidomics datasets with DIA-NN preinstalled. The pipeline facilitates data quality control, sample filtering, and normalization, ensuring robust and reliable downstream analyses. ProtPipe provides downstream analyses, including protein and peptide differential abundance identification, pathway enrichment analysis, protein-protein interaction analysis, and Major histocompatibility complex (MHC) -peptide binding affinity analysis. ProtPipe generates annotated tables and visualizations by performing statistical postprocessing and calculating fold changes between predefined pairwise conditions in an experimental design. It is an open-source, well-documented tool available online at https://github.com/NIH-CARD/ProtPipe, with a user-friendly web interface.

Keywords: Data analysis pipeline; Immunopeptidomics; Mass spectrometry; ProtPipe; Proteomics.