Phylogenetic mapping of recombination hotspots in human immunodeficiency virus via spatially smoothed change-point processes

Genetics. 2007 Apr;175(4):1773-85. doi: 10.1534/genetics.106.066258. Epub 2006 Dec 28.

Abstract

We present a Bayesian framework for inferring spatial preferences of recombination from multiple putative recombinant nucleotide sequences. Phylogenetic recombination detection has been an active area of research for the last 15 years. However, only recently attempts to summarize information from several instances of recombination have been made. We propose a hierarchical model that allows for simultaneous inference of recombination breakpoint locations and spatial variation in recombination frequency. The dual multiple change-point model for phylogenetic recombination detection resides at the lowest level of our hierarchy under the umbrella of a common prior on breakpoint locations. The hierarchical prior allows for information about spatial preferences of recombination to be shared among individual data sets. To overcome the sparseness of breakpoint data, dictated by the modest number of available recombinant sequences, we a priori impose a biologically relevant correlation structure on recombination location log odds via a Gaussian Markov random field hyperprior. To examine the capabilities of our model to recover spatial variation in recombination frequency, we simulate recombination from a predefined distribution of breakpoint locations. We then proceed with the analysis of 42 human immunodeficiency virus (HIV) intersubtype gag recombinants and identify a putative recombination hotspot.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence
  • Bayes Theorem
  • Chromosome Mapping
  • DNA, Mitochondrial / genetics
  • DNA, Viral / genetics
  • Genome, Viral
  • HIV / genetics*
  • Humans
  • Markov Chains
  • Models, Genetic*
  • Monte Carlo Method
  • Phylogeny
  • Primates / genetics
  • Recombination, Genetic*

Substances

  • DNA, Mitochondrial
  • DNA, Viral