Gene conversion and the evolution of protocadherin gene cluster diversity

Genome Res. 2004 Mar;14(3):354-66. doi: 10.1101/gr.2133704.

Abstract

The synaptic cell adhesion molecules encoded by the protocadherin gene cluster are hypothesized to provide a molecular code involved in the generation of synaptic complexity in the developing brain. Variation in copy number and sequence content of protocadherin cluster genes among vertebrate species could reflect adaptive differences in protocadherin function. We have completed an analysis of zebrafish protocadherin cluster genes. Zebrafish have two unlinked protocadherin clusters, DrPcdh1 and DrPcdh2. Like mammalian protocadherin clusters, DrPcdh1 has both alpha and gamma variable and constant region exons. A consensus protocadherin promoter motif sequence identified in mammals is also conserved in zebrafish. Few orthologous relationships, however, are apparent between zebrafish and mammalian protocadherin proteins. Here we show that protocadherin cluster genes in human, mouse, rat, and zebrafish are subject to striking gene conversion events. These events are restricted to regions of the coding sequence, particularly the coding sequences of ectodomain 6 and the cytoplasmic domain. Diversity among paralogs is restricted to particular ectodomains that are excluded from conversion events. Conversion events are also strongly correlated with an increase in third-position GC content. We propose that the combination of lineage-specific duplication, restricted gene conversion, and adaptive variation in diversified ectodomains drives vertebrate protocadherin cluster evolution.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Cadherins / genetics*
  • Computational Biology / methods
  • Computational Biology / statistics & numerical data
  • Evolution, Molecular*
  • Gene Conversion / genetics*
  • Genetic Variation / genetics*
  • Humans
  • Mice
  • Molecular Sequence Data
  • Multigene Family / genetics*
  • Phylogeny
  • Rats
  • Zebrafish / genetics
  • Zebrafish Proteins / genetics

Substances

  • Cadherins
  • Zebrafish Proteins

Associated data

  • GENBANK/AC144823
  • GENBANK/AC144826
  • GENBANK/AC144828
  • GENBANK/AC146480