Comparative DNA sequence analysis of mouse and human protocadherin gene clusters

Genome Res. 2001 Mar;11(3):389-404. doi: 10.1101/gr.167301.

Abstract

The genomic organization of the human protocadherin alpha, beta, and gamma gene clusters (designated Pcdh alpha [gene symbol PCDHA], Pcdh beta [PCDHB], and Pcdh gamma [PCDHG]) is remarkably similar to that of immunoglobulin and T-cell receptor genes. The extracellular and transmembrane domains of each protocadherin protein are encoded by an unusually large "variable" region exon, while the intracellular domains are encoded by three small "constant" region exons located downstream from a tandem array of variable region exons. Here we report the results of a comparative DNA sequence analysis of the orthologous human (750 kb) and mouse (900 kb) protocadherin gene clusters. The organization of Pcdh alpha and Pcdh gamma gene clusters in the two species is virtually identical, whereas the mouse Pcdh beta gene cluster is larger and contains more genes than the human Pcdh beta gene cluster. We identified conserved DNA sequences upstream of the variable region exons, and found that these sequences are more conserved between orthologs than between paralogs. Within this region, there is a highly conserved DNA sequence motif located at about the same position upstream of the translation start codon of each variable region exon. In addition, the variable region of each gene cluster contains a rich array of CpG islands, whose location corresponds to the position of each variable region exon. These observations are consistent with the proposal that the expression of each variable region exon is regulated by a distinct promoter, which is highly conserved between orthologous variable region exons in mouse and human.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Base Composition
  • Cadherins / genetics*
  • Cadherins / isolation & purification
  • Carrier Proteins / genetics
  • Chromosome Mapping
  • Conserved Sequence
  • CpG Islands / genetics
  • Evolution, Molecular
  • Exons / genetics
  • Genetic Variation
  • Humans
  • Mice
  • Molecular Sequence Data
  • Multigene Family / genetics*
  • Phylogeny
  • Protein Precursors / genetics*
  • Protein Precursors / isolation & purification
  • Sequence Analysis, DNA / methods*
  • Transcription Factors / genetics

Substances

  • Cadherins
  • Carrier Proteins
  • Protein Precursors
  • Transcription Factors

Associated data

  • GENBANK/AF332005
  • GENBANK/AF332006
  • GENBANK/AY013756
  • GENBANK/AY013757
  • GENBANK/AY013758
  • GENBANK/AY013759
  • GENBANK/AY013760
  • GENBANK/AY013761
  • GENBANK/AY013762
  • GENBANK/AY013763
  • GENBANK/AY013764
  • GENBANK/AY013765
  • GENBANK/AY013766
  • GENBANK/AY013767
  • GENBANK/AY013768
  • GENBANK/AY013769
  • GENBANK/AY013770
  • GENBANK/AY013771
  • GENBANK/AY013772
  • GENBANK/AY013773
  • GENBANK/AY013774
  • GENBANK/AY013775
  • GENBANK/AY013776
  • GENBANK/AY013777
  • GENBANK/AY013778
  • GENBANK/AY013779
  • GENBANK/AY013780
  • GENBANK/AY013781
  • GENBANK/AY013782
  • GENBANK/AY013783
  • GENBANK/AY013784
  • GENBANK/AY013785
  • GENBANK/AY013786
  • GENBANK/AY013787
  • GENBANK/AY013788
  • GENBANK/AY013789
  • GENBANK/AY013790
  • GENBANK/AY013791
  • GENBANK/AY013792
  • GENBANK/AY013793
  • GENBANK/AY013794
  • GENBANK/AY013795
  • GENBANK/AY013796
  • GENBANK/AY013797
  • GENBANK/AY013798
  • GENBANK/AY013799
  • GENBANK/AY013800
  • GENBANK/AY013801
  • GENBANK/AY013802
  • GENBANK/AY013803
  • GENBANK/AY013804
  • GENBANK/AY013805
  • GENBANK/AY013806
  • GENBANK/AY013807
  • GENBANK/AY013808
  • GENBANK/AY013809
  • GENBANK/AY013810
  • GENBANK/AY013811
  • GENBANK/AY013812
  • GENBANK/AY013813
  • GENBANK/AY013873
  • GENBANK/AY013874
  • GENBANK/AY013875
  • GENBANK/AY013876
  • GENBANK/AY013877
  • GENBANK/AY013878