Numerical characterization and similarity analysis of DNA sequences based on 2-D graphical representation of the characteristic sequences

Comb Chem High Throughput Screen. 2003 Dec;6(8):795-9. doi: 10.2174/138620703771826900.

Abstract

Based on the classifications of the four nucleic acid bases, He and Wang reduced a DNA sequence to three binary sequences, which are called the characteristic sequences (J. Chem. Inf. Comput. Sci. 42 (2002) 1080). In this paper, we associate each characteristic sequence with a (b)L / (b)L matrix by giving a 2-D 'two horizontal lines' graphical representation, and thus obtain a 3-component vector with entries being the sums of the maximal and minimal eigenvalues of the (b)L / (b)L matrices. The introduced vector results in more simple characterizations and comparisons among the coding sequences of exon 1 of beta-globin gene of eleven different species.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Animals
  • Base Sequence
  • DNA / chemistry*
  • Globins / genetics
  • Humans
  • Molecular Sequence Data
  • Sequence Analysis, DNA / methods*
  • Sequence Homology, Nucleic Acid
  • Statistics as Topic / methods

Substances

  • Globins
  • DNA