Improving Accuracy of Somatic Mutation Profiling in Large Epidemiologic Studies: Addressing Cases without Matched Normal Samples

bioRxiv [Preprint]. 2024 Oct 31:2024.10.28.617052. doi: 10.1101/2024.10.28.617052.

Abstract

Ideally, detection of somatic mutations in a tumor is accomplished using a patient-matched sample of normal cells as the benchmark. In this way somatic mutations can be distinguished from rare germline mutations. In large retrospective studies, archival tissue collection can pose challenges in obtaining samples of normal DNA. In this article we propose a protocol that improves somatic mutation analysis in the absence of a matched normal sample. The method was motivated by the InterMEL study, a large-scale epidemiologic investigation involving multiomic, multi-institutional genomic profiling of 1000 primary melanoma samples. The key insight for accomplishing improved mutation calling is the fact that germline mutations should produce a variant allele frequency (VAF) of around 50%. While a similar VAF of 50% would also be expected for somatic mutations in pure tumor samples, typically the tumor purity is much less than 50%, resulting in a considerably lower VAF. Making use of a technique that can simultaneously estimate both tumor purity and VAF from tumor-only samples we have developed a method for better distinguishing somatic versus germline variants. Based on 137 melanomas from the InterMEL Study with matched normal tissue to provide a gold standard we show that the conventional pipeline using a panel of (unmatched) normal samples has a false positive rate of 15.6% and a false negative rate of 3.5%. Our new technique improves these error rates to 6.4% and 2.1%, respectively.

Publication types

  • Preprint