Creation of a Human Secretome: A Novel Composite Library of Human Secreted Proteins: Validation Using Ovarian Cancer Gene Expression Data and a Virtual Secretome Array

Clin Cancer Res. 2015 Nov 1;21(21):4960-9. doi: 10.1158/1078-0432.CCR-14-3173. Epub 2015 May 5.

Abstract

Purpose: To generate a comprehensive "Secretome" of proteins potentially found in the blood and derive a virtual Affymetrix array. To validate the utility of this database for the discovery of novel serum-based biomarkers using ovarian cancer transcriptomic data.

Experimental design: The secretome was constructed by aggregating the data from databases of known secreted proteins, transmembrane or membrane proteins, signal peptides, G-protein coupled receptors, or proteins existing in the extracellular region, and the virtual array was generated by mapping them to Affymetrix probeset identifiers. Whole-genome microarray data from ovarian cancer, normal ovarian surface epithelium, and fallopian tube epithelium were used to identify transcripts upregulated in ovarian cancer.

Results: We established the secretome from eight public databases and a virtual array consisting of 16,521 Affymetrix U133 Plus 2.0 probesets. Using ovarian cancer transcriptomic data, we identified candidate blood-based biomarkers for ovarian cancer and performed bioinformatic validation by demonstrating rediscovery of known biomarkers including CA125 and HE4. Two novel top biomarkers (FGF18 and GPR172A) were validated in serum samples from an independent patient cohort.

Conclusions: We present the secretome, comprising the most comprehensive resource available for protein products that are potentially found in the blood. The associated virtual array can be used to translate gene-expression data into cancer biomarker discovery. A list of blood-based biomarkers for ovarian cancer detection is reported and includes CA125 and HE4. FGF18 and GPR172A were identified and validated by ELISA as being differentially expressed in the serum of ovarian cancer patients compared with controls.

MeSH terms

  • Biomarkers, Tumor
  • Cluster Analysis
  • Computational Biology / methods
  • Databases, Genetic
  • Female
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic
  • Gene Library
  • Humans
  • Ovarian Neoplasms / genetics
  • Ovarian Neoplasms / metabolism*
  • Proteome / metabolism*
  • Proteomics* / methods
  • Reproducibility of Results
  • Signal Transduction
  • Transcriptome

Substances

  • Biomarkers, Tumor
  • Proteome