Dynamics of a human interparalog gene conversion hotspot

Genome Res. 2004 May;14(5):835-44. doi: 10.1101/gr.2177404.

Abstract

Gene conversion between paralogs can alter their patterns of sequence identity, thus obscuring their evolutionary relationships and affecting their propensity to sponsor genomic rearrangements. The details of this important process are poorly understood in the human genome because allelic diversity complicates the interpretation of interparalog sequence differences. Here we exploit the haploid nature of the Y chromosome, which obviates complicating interallelic processes, together with its known phylogeny, to understand the dynamics of conversion between two directly repeated HERVs flanking the 780-kb AZFa region on Yq. Sequence analysis of a 787-bp segment of each of the HERVs in 36 Y chromosomes revealed one of the highest nucleotide diversities in the human genome, as well as evidence of a complex patchwork of highly directional gene conversion events. The rate of proximal-to-distal conversion events was estimated as 2.4 x 10(-4) to 1.2 x 10(-3) per generation (3.9 x 10(-7) to 1.9 x 10(-6) per base per generation), and the distal-to-proximal rate as about one-twentieth of this. Minimum observed conversion tract lengths ranged from 1 to 158 bp and maximum lengths from 19 to 1365 bp, with an estimated mean of 31 bp. Analysis of great ape homologs shows that conversion in this hotspot has a deep evolutionary history.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence / genetics
  • Chromosome Mapping / methods
  • Conserved Sequence / genetics
  • DNA / genetics
  • DNA, Viral / genetics
  • Databases, Genetic
  • Endogenous Retroviruses / genetics
  • Evolution, Molecular
  • Gene Conversion / genetics*
  • Genetic Variation / genetics
  • Gorilla gorilla / genetics
  • Humans
  • Male
  • Molecular Sequence Data
  • Nucleic Acid Amplification Techniques / methods
  • Pan troglodytes / genetics
  • Phylogeny
  • Recombination, Genetic / genetics
  • Sequence Alignment
  • Sequence Homology, Nucleic Acid
  • Y Chromosome / genetics
  • Y Chromosome / virology

Substances

  • DNA, Viral
  • DNA

Associated data

  • GENBANK/AY500148
  • GENBANK/AY500149
  • GENBANK/AY500150
  • GENBANK/AY500151
  • GENBANK/AY549481
  • GENBANK/AY549482
  • GENBANK/AY549483
  • GENBANK/AY549484
  • GENBANK/AY549485
  • GENBANK/AY549486
  • GENBANK/AY549487
  • GENBANK/AY549488
  • GENBANK/AY549489
  • GENBANK/AY549490
  • GENBANK/AY549491