Structure of the gene for cartilage matrix protein, a modular protein of the extracellular matrix. Exon/intron organization, unusual splice sites, and relation to alpha chains of beta 2 integrins, von Willebrand factor, complement factors B and C2, and epidermal growth factor

J Biol Chem. 1989 May 15;264(14):8126-34.

Abstract

The entire gene for chicken cartilage matrix protein (CMP) has been isolated and characterized by restriction mapping, electron microscopy, nuclease S1 mapping, and sequence analysis. The gene, which is present in a single copy in the chicken genome, is 18 kilobase pairs long and comprises eight exons and seven introns. It has two transcription initiation sites, 8 base pairs from each other. A sequence very homologous to the consensus nuclear factor III binding-site sequence, a CAT- and a TATA-like sequence are found in the promoter region and ATTAAA is used as a polyadenylation signal. The nucleotide sequence defines a primary translation product of 493 amino acids which consists of a 23-amino acid signal peptide and two large repeated domains connected by an epidermal growth factor module. Amino acid sequences homologous to those of the repeated domains are present in the type A repeats of von Willebrand factor, complement factors B and C2, and in the alpha chains of the integrins Mac-1, p150,95, and LFA-1. The exon-intron structure indicates that the CMP gene may have arisen by exon duplication and exon shuffling during evolution. The GT-AG splice rule cannot be applied for the excision of the last intron of the CMP pre-mRNA. The donor splice site of intron G is basically different from the consensus sequence indicating that a novel type of splicing mechanism might exist in cartilage.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • Complement C2 / genetics
  • Complement Factor B / genetics
  • DNA Restriction Enzymes
  • Endonucleases
  • Epidermal Growth Factor / genetics
  • Exons*
  • Extracellular Matrix / analysis*
  • Extracellular Matrix Proteins*
  • Glycoproteins / genetics*
  • Integrins
  • Introns*
  • Matrilin Proteins
  • Membrane Glycoproteins / genetics
  • Microscopy, Electron
  • Molecular Sequence Data
  • Promoter Regions, Genetic
  • RNA Splicing
  • Repetitive Sequences, Nucleic Acid
  • Sequence Homology, Nucleic Acid
  • Single-Strand Specific DNA and RNA Endonucleases
  • Transcription, Genetic
  • von Willebrand Factor / genetics

Substances

  • Complement C2
  • Extracellular Matrix Proteins
  • Glycoproteins
  • Integrins
  • Matrilin Proteins
  • Membrane Glycoproteins
  • von Willebrand Factor
  • Epidermal Growth Factor
  • Endonucleases
  • DNA Restriction Enzymes
  • Single-Strand Specific DNA and RNA Endonucleases
  • Complement Factor B

Associated data

  • GENBANK/X12346
  • GENBANK/X12347
  • GENBANK/X12348
  • GENBANK/X12349
  • GENBANK/X12350
  • GENBANK/X12351
  • GENBANK/X12352
  • GENBANK/X12353
  • GENBANK/X12354