Polymorphic open reading frames encoding secretory proteins are located less than 3 kilobases from Theileria parva telomeres

Mol Biochem Parasitol. 2000 Oct;110(2):359-71. doi: 10.1016/s0166-6851(00)00291-7.

Abstract

Polymorphic, multicopy gene families are frequently located in subtelomeric regions of the genomes of parasitic protozoa. Theileria parva telomere-associated (TA) DNA from two chromosomes contained long open reading frames (ORFs) 54% identical at the N-termini, whose 3' ends were 2670 and 2680 bp from the telomeric repeats. Probes derived from these ORFs revealed related sequences close to additional telomeres. The 3' end of an unrelated ORF was approximately 2720 bp from a third telomere. These are among the closest ORFs to telomeres in any organism. Reverse transcription PCR detected transcripts originating within the telomeric multicopy gene family. Additional ORFs, with complex sequence similarities, were located centromeric to the telomere-adjacent ORFs. Transcripts from the schizont stage of T. parva, containing domains with significant amino acid similarity to a 3529 codon ORF located 6900 bp upstream of the telomeric repeats, were mapped to a subtelomeric locus at a fourth telomere. Five telomeric ORFs contained predicted N-terminal signal peptides and one of these signal peptides was functional in a heterologous system. Hybridisation data suggested extensive strain polymorphism between ORFs. Two of the telomere-adjacent ORFs were absent from the genome of a cloned T. parva parasite which can, nonetheless, be passaged through ticks and cattle. T. parva is unusual, among organisms so far studied, in the high density of potential coding sequences located directly adjacent to telomeres and the apparent absence of extensive tracts of repeated sequences within the TA DNA.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Blotting, Southern
  • Cattle
  • Chromosome Mapping*
  • DNA, Complementary / genetics
  • Deoxyribonucleases, Type II Site-Specific / metabolism
  • Electrophoresis, Gel, Pulsed-Field
  • Molecular Sequence Data
  • Open Reading Frames / genetics*
  • Polymorphism, Genetic
  • Protozoan Proteins / chemistry
  • Protozoan Proteins / genetics*
  • Protozoan Proteins / metabolism
  • Sequence Analysis, DNA
  • Telomere / genetics*
  • Theileria parva / genetics*
  • Theileriasis / parasitology

Substances

  • DNA, Complementary
  • Protozoan Proteins
  • endodeoxyribonuclease SfiI
  • Deoxyribonucleases, Type II Site-Specific

Associated data

  • GENBANK/AF198437
  • GENBANK/AF198438
  • GENBANK/AF198439
  • GENBANK/AF225701