Two genes, COL4A3 and COL4A4 coding for the human alpha3(IV) and alpha4(IV) collagen chains are arranged head-to-head on chromosome 2q36

FEBS Lett. 1998 Mar 6;424(1-2):11-6. doi: 10.1016/s0014-5793(98)00128-8.

Abstract

We first isolated and characterized genomic DNA fragments that cover the 5' flanking sequences of COL4A3 and COL4A4 encoding the human basement membrane alpha3(IV) and alpha4(IV) collagen chains, respectively. Nucleotide sequence analysis indicated that the two genes are arranged head-to-head. To determine transcription start site for COL4A4 gene, we performed RACE and RNase protection assays, indicating that there are two alternative transcripts presumably derived from two different promoters. Interestingly, one transcription start site (from exon 1') of COL4A4 is only 5 bp away from the reported transcription start site of COL4A3, whereas the other transcript (from exon 1) starts 373 nucleotides downstream from the first one, generating the two kinds of transcripts that differ in the 5' UTR regions. Expression of these two transcripts appears tissue-specific; exon 1 transcript was expressed predominantly in epithelial cells, while exon 1' transcript showed rather ubiquitous and low expression. The nucleotide sequence of the promoter region is composed of dense CpG dinucleotides, GC boxes, CTC boxes and a CCAAT box but no TATA box. These results provide information to delineate the promoter activity for the tissue-specific expression of the six type IV collagen genes and basement membrane assembly in different tissues and organs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing
  • Amino Acid Sequence
  • Base Sequence
  • Cell Culture Techniques
  • Chromosome Mapping
  • Chromosomes, Human, Pair 2 / genetics*
  • Collagen / genetics*
  • Humans
  • Molecular Sequence Data
  • Promoter Regions, Genetic
  • RNA, Messenger / metabolism
  • Ribonucleases
  • Transcription, Genetic

Substances

  • RNA, Messenger
  • Collagen
  • Ribonucleases

Associated data

  • GENBANK/AB008495
  • GENBANK/AB008496