A panel sequencing dataset of peripheral blood gene variations in pan-cancer

Sci Data. 2024 Jul 20;11(1):805. doi: 10.1038/s41597-024-03620-6.

Abstract

Circulating cell-free DNA (cfDNA) in the peripheral blood is a promising biomarker for cancer diagnosis and prognosis. Somatic mutations identified in cancers have been used to detect therapeutic targets for clinical transformation and individualize drug selection, while germline variants can predict a patient's risk of developing cancer and drug sensitivity. However, no platform has been developed to analyze, calculate, integrate, and friendly visualize these pan-cancer cfDNA mutations deeply. In this work, we performed panel sequencing encompassing 1,115 cancer-related genes across 16,659 cancer patients, spanning 27 cancer types. We detected 496 germline variants in leukocytes and 11,232 somatic mutations in the cfDNA of all patients. CPGV (Cancer Peripheral blood Gene Variations), a database constructed from this dataset, is the first pan-cancer cfDNA database that encompasses somatic mutations, germline variants, and further comparative analyses of mutations across different cancer types. It bears great promise to serve as a valuable resource for cancer research.

Publication types

  • Dataset

MeSH terms

  • Biomarkers, Tumor / blood
  • Biomarkers, Tumor / genetics
  • Cell-Free Nucleic Acids / blood
  • Cell-Free Nucleic Acids / genetics
  • Databases, Genetic
  • Genetic Variation
  • Germ-Line Mutation
  • Humans
  • Mutation
  • Neoplasms* / blood
  • Neoplasms* / genetics

Substances

  • Cell-Free Nucleic Acids
  • Biomarkers, Tumor