Residual tissue repositories as a resource for population-based cancer proteomic studies

Clin Proteomics. 2018 Aug 3:15:26. doi: 10.1186/s12014-018-9202-4. eCollection 2018.

Abstract

Background: Mass spectrometry-based proteomics has become a powerful tool for the identification and quantification of proteins from a wide variety of biological specimens. To date, the majority of studies utilizing tissue samples have been carried out on prospectively collected fresh frozen or optimal cutting temperature (OCT) embedded specimens. However, such specimens are often difficult to obtain, in limited in supply, and clinical information and outcomes on patients are inherently delayed as compared to banked samples. Annotated formalin fixed, paraffin embedded (FFPE) tumor tissue specimens are available for research use from a variety of tissue banks, such as from the surveillance, epidemiology and end results (SEER) registries' residual tissue repositories. Given the wealth of outcomes information associated with such samples, the reuse of archived FFPE blocks for deep proteomic characterization with mass spectrometry technologies would provide a valuable resource for population-based cancer studies. Further, due to the widespread availability of FFPE specimens, validation of specimen integrity opens the possibility for thousands of studies that can be conducted worldwide.

Methods: To examine the suitability of the SEER repository tissues for proteomic and phosphoproteomic analysis, we analyzed 60 SEER patient samples, with time in storage ranging from 7 to 32 years; 60 samples with expression proteomics and 18 with phosphoproteomics, using isobaric labeling. Linear modeling and gene set enrichment analysis was used to evaluate the impacts of collection site and storage time.

Results: All samples, regardless of age, yielded suitable protein mass after extraction for expression analysis and 18 samples yielded sufficient mass for phosphopeptide analysis. Although peptide, protein, and phosphopeptide identifications were reduced by 50, 20 and 76% respectively, from comparable OCT specimens, we found no statistically significant differences in protein quantitation correlating with collection site or specimen age. GSEA analysis of GO-term level measurements of protein abundance differences between FFPE and OCT embedded specimens suggest that the formalin fixation process may alter representation of protein categories in the resulting dataset.

Conclusions: These studies demonstrate that residual FFPE tissue specimens, of varying age and collection site, are a promising source of protein for proteomic investigations if paired with rigorously verified mass spectrometry workflows.

Keywords: Formalin fixed paraffin embedded; Phosphoproteomics; Proteomics; Surveillance, epidemiology and end results; Tandem mass tags.