Error rates, PCR recombination, and sampling depth in HIV-1 whole genome deep sequencing

Virus Res. 2017 Jul 15:239:106-114. doi: 10.1016/j.virusres.2016.12.009. Epub 2016 Dec 27.

Abstract

Deep sequencing is a powerful and cost-effective tool to characterize the genetic diversity and evolution of virus populations. While modern sequencing instruments readily cover viral genomes many thousand fold and very rare variants can in principle be detected, sequencing errors, amplification biases, and other artifacts can limit sensitivity and complicate data interpretation. For this reason, the number of studies using whole genome deep sequencing to characterize viral quasi-species in clinical samples is still limited. We have previously undertaken a large scale whole genome deep sequencing study of HIV-1 populations. Here we discuss the challenges, error profiles, control experiments, and computational test we developed to quantify the accuracy of variant frequency estimation.

Keywords: Amplification bias; Indel errors; Population sequencing.

Publication types

  • Review

MeSH terms

  • Computational Biology / methods
  • Genetic Variation*
  • Genome, Viral*
  • Genotype
  • HIV Infections / virology*
  • HIV-1 / genetics*
  • Humans
  • INDEL Mutation
  • Polymerase Chain Reaction* / methods
  • Polymerase Chain Reaction* / standards
  • Recombination, Genetic*
  • Reproducibility of Results