Identification and analysis of chromodomain-containing proteins encoded in the mouse transcriptome

Genome Res. 2003 Jun;13(6B):1416-29. doi: 10.1101/gr.1015703.

Abstract

The chromodomain is 40-50 amino acids in length and is conserved in a wide range of chromatic and regulatory proteins involved in chromatin remodeling. Chromodomain-containing proteins can be classified into families based on their broader characteristics, in particular the presence of other types of domains, and which correlate with different subclasses of the chromodomains themselves. Hidden Markov model (HMM)-generated profiles of different subclasses of chromodomains were used here to identify sequences encoding chromodomain-containing proteins in the mouse transcriptome and genome. A total of 36 different loci encoding proteins containing chromodomains, including 17 novel loci, were identified. Six of these loci (including three apparent pseudogenes, a novel HP1 ortholog, and two novel Msl-3 transcription factor-like proteins) are not present in the human genome, whereas the human genome contains four loci (two CDY orthologs and two apparent CDY pseudogenes) that are not present in mouse. A number of these loci exhibit alternative splicing to produce different isoforms, including 43 novel variants, some of which lack the chromodomain. The likely functions of these proteins are discussed in relation to the known functions of other chromodomain-containing proteins within the same family.

MeSH terms

  • Acetyltransferases / chemistry
  • Acetyltransferases / genetics
  • Animals
  • Ankyrins / chemistry
  • Ankyrins / genetics
  • Carrier Proteins / chemistry
  • Carrier Proteins / genetics
  • Chromobox Protein Homolog 5
  • Chromosomal Proteins, Non-Histone / chemistry
  • Chromosomal Proteins, Non-Histone / genetics
  • Chromosome Mapping / methods
  • Chromosome Mapping / statistics & numerical data
  • Computational Biology / methods
  • Computational Biology / statistics & numerical data
  • DNA Helicases / chemistry
  • DNA Helicases / genetics
  • DNA-Binding Proteins / chemistry
  • DNA-Binding Proteins / genetics
  • Databases, Genetic / statistics & numerical data
  • Drosophila Proteins*
  • Enoyl-CoA Hydratase / chemistry
  • Enoyl-CoA Hydratase / genetics
  • Histone Acetyltransferases
  • Histone Methyltransferases
  • Histone-Lysine N-Methyltransferase*
  • Humans
  • Markov Chains
  • Methyltransferases / chemistry
  • Methyltransferases / genetics
  • Mice
  • Nuclear Proteins / chemistry
  • Nuclear Proteins / genetics
  • Polycomb-Group Proteins
  • Protein Methyltransferases
  • Protein Structure, Tertiary / genetics
  • Proteins / chemistry*
  • Proteins / genetics*
  • Proteome / chemistry
  • Proteome / genetics
  • Repressor Proteins / chemistry
  • Repressor Proteins / genetics
  • Retinoblastoma-Binding Protein 1
  • Saccharomyces cerevisiae Proteins / chemistry
  • Saccharomyces cerevisiae Proteins / genetics
  • Sequence Homology, Nucleic Acid
  • Transcription Factors / chemistry
  • Transcription Factors / genetics
  • Transcription, Genetic / genetics*

Substances

  • Ankyrins
  • Arid4a protein, mouse
  • Carrier Proteins
  • Chromosomal Proteins, Non-Histone
  • DNA-Binding Proteins
  • Drosophila Proteins
  • Nuclear Proteins
  • Polycomb-Group Proteins
  • Proteins
  • Proteome
  • Repressor Proteins
  • Retinoblastoma-Binding Protein 1
  • SMARCA2 protein, human
  • Saccharomyces cerevisiae Proteins
  • Smarca2 protein, mouse
  • Transcription Factors
  • Chromobox Protein Homolog 5
  • msl-3 protein, Drosophila
  • Histone Methyltransferases
  • Methyltransferases
  • Protein Methyltransferases
  • Histone-Lysine N-Methyltransferase
  • Acetyltransferases
  • Histone Acetyltransferases
  • SMARCA4 protein, human
  • Smarca4 protein, mouse
  • DNA Helicases
  • Enoyl-CoA Hydratase