Genome-wide direct quantification of in vivo mutagenesis using high-accuracy paired-end and complementary consensus sequencing

Nucleic Acids Res. 2023 Nov 27;51(21):e109. doi: 10.1093/nar/gkad909.

Abstract

Error-corrected next-generation sequencing (ecNGS) is an emerging technology for accurately measuring somatic mutations. Here, we report paired-end and complementary consensus sequencing (PECC-Seq), a high-accuracy ecNGS approach for genome-wide somatic mutation detection. We characterize a novel 2-aminoimidazolone lesion besides 7,8-dihydro-8-oxoguanine and the resulting end-repair artifacts originating from NGS library preparation that obscure the sequencing accuracy of NGS. We modify library preparation protocol for the enzymatic removal of end-repair artifacts and improve the accuracy of our previously developed duplex consensus sequencing method. Optimized PECC-Seq shows an error rate of <5 × 10-8 with consensus bases compressed from approximately 25 Gb of raw sequencing data, enabling the accurate detection of low-abundance somatic mutations. We apply PECC-Seq to the quantification of in vivo mutagenesis. Compared with the classic gpt gene mutation assay using gpt delta transgenic mice, PECC-Seq exhibits high sensitivity in quantitatively measuring dose-dependent mutagenesis induced by Aristolochic acid I (AAI). Moreover, PECC-Seq specifically characterizes the distinct genome-wide mutational signatures of AAI, Benzo[a]pyrene, N-Nitroso-N-ethylurea and N-nitrosodiethylamine and reveals the mutational signature of Quinoline in common mouse models. Overall, our findings demonstrate that high-accuracy PECC-Seq is a promising tool for genome-wide somatic mutagenesis quantification and for in vivo mutagenicity testing.

MeSH terms

  • Animals
  • Consensus
  • High-Throughput Nucleotide Sequencing* / methods
  • Male
  • Mice
  • Mice, Transgenic
  • Mutagenesis / genetics
  • Mutation
  • Sequence Analysis, DNA / methods