Background: It is possible to identify thousands of phosphopeptides and -proteins in a single experiment with mass spectrometry-based phosphoproteomics. However, a current bottleneck is the downstream data analysis which is often laborious and requires a number of manual steps.
Results: Toward automating the analysis steps, we have developed and implemented a software, PhosFox, which enables peptide-level processing of phosphoproteomic data generated by multiple protein identification search algorithms, including Mascot, Sequest, and Paragon, as well as cross-comparison of their identification results. The software supports both qualitative and quantitative phosphoproteomics studies, as well as multiple between-group comparisons. Importantly, PhosFox detects uniquely phosphorylated peptides and proteins in one sample compared to another. It also distinguishes differences in phosphorylation sites between phosphorylated proteins in different samples. Using two case study examples, a qualitative phosphoproteome dataset from human keratinocytes and a quantitative phosphoproteome dataset from rat kidney inner medulla, we demonstrate here how PhosFox facilitates an efficient and in-depth phosphoproteome data analysis. PhosFox was implemented in the Perl programming language and it can be run on most common operating systems. Due to its flexible interface and open source distribution, the users can easily incorporate the program into their MS data analysis workflows and extend the program with new features. PhosFox source code, implementation and user instructions are freely available from https://bitbucket.org/phintsan/phosfox.
Conclusions: PhosFox facilitates efficient and more in-depth comparisons between phosphoproteins in case-control settings. The open source implementation is easily extendable to accommodate additional features for widespread application use cases.
Keywords: Data processing and analysis; Database searching; LC-MS/MS; Phosphoproteomics.