Statistical physics approach to categorize biologic signals: from heart rate dynamics to DNA sequences

Chaos. 2007 Mar;17(1):015115. doi: 10.1063/1.2716147.

Abstract

We recently proposed a novel approach to categorize information carried by symbolic sequences based on their usage of repetitive patterns. A simple quantitative index to measure the dissimilarity between two symbolic sequences can be defined. This information dissimilarity index, defined by our formula, is closely related to the Shannon entropy and rank order of the repetitive patterns in the symbolic sequences. Here we discuss the underlying statistical physics assumptions of this dissimilarity index. We use human cardiac interbeat interval time series and DNA sequences as examples to illustrate the applicability of this generic approach to real-world problems.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Biophysics / methods*
  • Cluster Analysis
  • Computer Simulation
  • Data Interpretation, Statistical*
  • Electrocardiography / methods*
  • Heart Rate / physiology*
  • Models, Biological*
  • Models, Statistical
  • Pattern Recognition, Automated / methods
  • Sequence Analysis, DNA / methods*
  • Stochastic Processes