Performance comparison of whole-genome sequencing platforms

Nat Biotechnol. 2011 Dec 18;30(1):78-82. doi: 10.1038/nbt.2065.

Abstract

Whole-genome sequencing is becoming commonplace, but the accuracy and completeness of variant calling by the most widely used platforms from Illumina and Complete Genomics have not been reported. Here we sequenced the genome of an individual with both technologies to a high average coverage of ∼76×, and compared their performance with respect to sequence coverage and calling of single-nucleotide variants (SNVs), insertions and deletions (indels). Although 88.1% of the ∼3.7 million unique SNVs were concordant between platforms, there were tens of thousands of platform-specific calls located in genes and other genomic regions. In contrast, 26.5% of indels were concordant between platforms. Target enrichment validated 92.7% of the concordant SNVs, whereas validation by genotyping array revealed a sensitivity of 99.3%. The validation experiments also suggested that >60% of the platform-specific variants were indeed present in the genome. Our results have important implications for understanding the accuracy and completeness of the genome sequencing platforms.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA, Intergenic / genetics
  • Exons / genetics
  • Genome, Human*
  • Genotype
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • INDEL Mutation / genetics
  • Introns / genetics
  • Polymorphism, Single Nucleotide / genetics*
  • Research Design / standards*
  • Untranslated Regions / genetics

Substances

  • DNA, Intergenic
  • Untranslated Regions