Background: Human immunodeficiency virus type 1 (HIV-1) genetic diversity increases over the course of infection and can be used to infer the time since infection and, consequently, infection recency, which are crucial for HIV-1 surveillance and the understanding of viral pathogenesis.
Methods: We considered 313 HIV-infected individuals for whom reliable estimates of infection dates and next-generation sequencing (NGS)-derived nucleotide frequency data were available. Fractions of ambiguous nucleotides, obtained by population sequencing, were available for 207 samples. We assessed whether the average pairwise diversity calculated using NGS sequences provided a more exact prediction of the time since infection and classification of infection recency (<1 year after infection), compared with the fraction of ambiguous nucleotides.
Results: NGS-derived average pairwise diversity classified an infection as recent with a sensitivity of 88% and a specificity of 85%. When considering only the 207 samples for which fractions of ambiguous nucleotides were available, the NGS-derived average pairwise diversity exhibited a higher sensitivity (90% vs 78%) and specificity (95% vs 67%) than the fraction of ambiguous nucleotides. Additionally, the average pairwise diversity could be used to estimate the time since infection with a mean absolute error of 0.84 years, compared with 1.03 years for the fraction of ambiguous nucleotides.
Conclusions: Viral diversity based on NGS data is more precise than that based on population sequencing in its ability to predict infection recency and provides an estimated time since infection that has a mean absolute error of <1 year.
Keywords: HIV-1; diversity; infection recency; next-generation sequencing; time since infection.
© The Author(s) 2019. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail: [email protected].