The Release 5.1 annotation of Drosophila melanogaster heterochromatin

Science. 2007 Jun 15;316(5831):1586-91. doi: 10.1126/science.1139815.

Abstract

The repetitive DNA that constitutes most of the heterochromatic regions of metazoan genomes has hindered the comprehensive analysis of gene content and other functions. We have generated a detailed computational and manual annotation of 24 megabases of heterochromatic sequence in the Release 5 Drosophila melanogaster genome sequence. The heterochromatin contains a minimum of 230 to 254 protein-coding genes, which are conserved in other Drosophilids and more diverged species, as well as 32 pseudogenes and 13 noncoding RNAs. Improved methods revealed that more than 77% of this heterochromatin sequence, including introns and intergenic regions, is composed of fragmented and nested transposable elements and other repeated DNAs. Drosophila heterochromatin contains "islands" of highly conserved genes embedded in these "oceans" of complex repeats, which may require special expression and splicing mechanisms.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Chromosome Mapping
  • Computational Biology
  • Conserved Sequence
  • DNA Transposable Elements
  • DNA, Complementary
  • Drosophila / genetics
  • Drosophila Proteins / genetics
  • Drosophila melanogaster / genetics*
  • Exons
  • Expressed Sequence Tags
  • Genes, Insect*
  • Genome
  • Heterochromatin / genetics*
  • Insecta / genetics
  • Introns
  • Pseudogenes
  • RNA, Untranslated / genetics
  • Repetitive Sequences, Nucleic Acid
  • Retroelements
  • Sequence Analysis, DNA

Substances

  • DNA Transposable Elements
  • DNA, Complementary
  • Drosophila Proteins
  • Heterochromatin
  • RNA, Untranslated
  • Retroelements