De novo detection of somatic variants in high-quality long-read single-cell RNA sequencing data

bioRxiv [Preprint]. 2024 Nov 5:2024.03.06.583775. doi: 10.1101/2024.03.06.583775.

Abstract

In cancer, genetic and transcriptomic variations generate clonal heterogeneity, leading to treatment resistance. Long-read single-cell RNA sequencing (LR scRNA-seq) has the potential to detect genetic and transcriptomic variations simultaneously. Here, we present LongSom, a computational workflow leveraging high-quality LR scRNA-seq data to call de novo somatic single-nucleotide variants (SNVs), including in mitochondria (mtSNVs), copy-number alterations (CNAs), and gene fusions, to reconstruct the tumor clonal heterogeneity. Before somatic variants calling, LongSom re-annotates marker gene based cell types using cell mutational profiles. LongSom distinguishes somatic SNVs from noise and germline polymorphisms by applying an extensive set of hard filters and statistical tests. Applying LongSom to human ovarian cancer samples, we detected clinically relevant somatic SNVs that were validated against matched DNA samples. Leveraging somatic SNVs and fusions, LongSom found subclones with different predicted treatment outcomes. In summary, LongSom enables de novo variants detection without the need for normal samples, facilitating the study of cancer evolution, clonal heterogeneity, and treatment resistance.

Publication types

  • Preprint