DHPC: a new tool to express genome structural features

Genomics. 2008 May;91(5):476-83. doi: 10.1016/j.ygeno.2008.01.003. Epub 2008 Mar 14.

Abstract

The DHPC (DNA Hilbert-Peano curve) is a new tool for visualizing large-scale genome sequences by mapping sequences into a two-dimensional square. It utilizes the space-filling function of Hilbert-Peano mapping. By applying a Gauss smoothing technique and a user-defined color function, a large-scale genome sequence can be mapped into a two-dimensional color image. In the calculated DHPCs, many genome characteristics are revealed. In this article we introduce the method and show how DHPCs may be used to identify regions of different base composition. The power of the method is demonstrated by presenting multiple examples such as repeating sequences, degree of base bias, regions of homogeneity and their boundaries, and mark of annotated segments. We also present several genome curves generated by DHPC to demonstrate how DHPC can be used to find previously unidentified sequence features in these genomes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Composition
  • Chickens
  • Chromosomes / chemistry
  • Chromosomes, Mammalian / chemistry
  • Dogs
  • Genome*
  • Humans
  • Internet
  • Sequence Analysis, DNA / methods*
  • Software*
  • Tetraodontiformes
  • Zebrafish