Fidelity of capture-enrichment for mtDNA genome sequencing: influence of NUMTs

Nucleic Acids Res. 2012 Oct;40(18):e137. doi: 10.1093/nar/gks499. Epub 2012 May 30.

Abstract

Enriching target sequences in sequencing libraries via capture hybridization to bait/probes is an efficient means of leveraging the capabilities of next-generation sequencing for obtaining sequence data from target regions of interest. However, homologous sequences from non-target regions may also be enriched by such methods. Here we investigate the fidelity of capture enrichment for complete mitochondrial DNA (mtDNA) genome sequencing by analyzing sequence data for nuclear copies of mtDNA (NUMTs). Using capture-enriched sequencing data from a mitochondria-free cell line and the parental cell line, and from samples previously sequenced from long-range PCR products, we demonstrate that NUMT alleles are indeed present in capture-enriched sequence data, but at low enough levels to not influence calling the authentic mtDNA genome sequence. However, distinguishing NUMT alleles from true low-level mutations (e.g. heteroplasmy) is more challenging. We develop here a computational method to distinguish NUMT alleles from heteroplasmies, using sequence data from artificial mixtures to optimize the method.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell Line
  • Cell Nucleus / genetics*
  • Computer Simulation
  • DNA, Mitochondrial / chemistry*
  • Genome, Mitochondrial*
  • Genomics / methods
  • Humans
  • Mutation
  • Polymerase Chain Reaction
  • Sequence Analysis, DNA*

Substances

  • DNA, Mitochondrial