Cancer of unknown primary (CUP) constitutes between 2% and 5% of human malignancies and is among the most common causes of cancer death in the United States. Brain metastases are often the first clinical presentation of CUP; despite extensive pathological and imaging studies, 20%-45% of CUP are never assigned a primary site. DNA methylation array profiling is a reliable method for tumor classification but tumor-type-specific classifier development requires many reference samples. This is difficult to accomplish for CUP as many cases are never assigned a specific diagnosis. Recent studies identified subsets of methylation quantitative trait loci (mQTLs) unique to specific organs, which could help increase classifier accuracy while requiring fewer samples. We performed a retrospective genome-wide methylation analysis of 759 carcinoma samples from formalin-fixed paraffin-embedded tissue samples using Illumina EPIC array. Utilizing mQTL specific for breast, lung, ovarian/gynecologic, colon, kidney, or testis (BLOCKT) (185k total probes), we developed a deep learning-based methylation classifier that achieved 93.12% average accuracy and 93.04% average F1-score across a 10-fold validation for BLOCKT organs. Our findings indicate that our organ-based DNA methylation classifier can assist pathologists in identifying the site of origin, providing oncologists insight on a diagnosis to administer appropriate therapy, improving patient outcomes.
Keywords: DNA methylation; EPIC array; cancer of unknown primary; classifier; deep learning; metastasis; molecular pathology.
© The Author(s) 2024. Published by Oxford University Press on behalf of American Association of Neuropathologists, Inc.