Evolutionary modeling of rate shifts reveals specificity determinants in HIV-1 subtypes

PLoS Comput Biol. 2008 Nov;4(11):e1000214. doi: 10.1371/journal.pcbi.1000214. Epub 2008 Nov 7.

Abstract

A hallmark of the human immunodeficiency virus 1 (HIV-1) is its rapid rate of evolution within and among its various subtypes. Two complementary hypotheses are suggested to explain the sequence variability among HIV-1 subtypes. The first suggests that the functional constraints at each site remain the same across all subtypes, and the differences among subtypes are a direct reflection of random substitutions, which have occurred during the time elapsed since their divergence. The alternative hypothesis suggests that the functional constraints themselves have evolved, and thus sequence differences among subtypes in some sites reflect shifts in function. To determine the contribution of each of these two alternatives to HIV-1 subtype evolution, we have developed a novel Bayesian method for testing and detecting site-specific rate shifts. The RAte Shift EstimatoR (RASER) method determines whether or not site-specific functional shifts characterize the evolution of a protein and, if so, points to the specific sites and lineages in which these shifts have most likely occurred. Applying RASER to a dataset composed of large samples of HIV-1 sequences from different group M subtypes, we reveal rampant evolutionary shifts throughout the HIV-1 proteome. Most of these rate shifts have occurred during the divergence of the major subtypes, establishing that subtype divergence occurred together with functional diversification. We report further evidence for the emergence of a new sub-subtype, characterized by abundant rate-shifting sites. When focusing on the rate-shifting sites detected, we find that many are associated with known function relating to viral life cycle and drug resistance. Finally, we discuss mechanisms of covariation of rate-shifting sites.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adaptation, Biological / genetics*
  • Amino Acid Sequence / physiology
  • Bayes Theorem
  • Drug Resistance, Multiple, Viral / genetics
  • Evolution, Molecular*
  • Genetic Speciation
  • Geography
  • HIV Infections / genetics
  • HIV-1 / genetics*
  • HIV-1 / pathogenicity
  • Humans
  • Models, Biological*
  • Phylogeny
  • Proteomics / methods
  • Time Factors
  • Virus Internalization