A Review on Planted (l, d) Motif Discovery Algorithms for Medical Diagnose

Sensors (Basel). 2022 Feb 5;22(3):1204. doi: 10.3390/s22031204.

Abstract

Personalized diagnosis of chronic disease requires capturing the continual pattern across the biological sequence. This repeating pattern in medical science is called "Motif". Motifs are the short, recurring patterns of biological sequences that are supposed signify some health disorder. They identify the binding sites for transcription factors that modulate and synchronize the gene expression. These motifs are important for the analysis and interpretation of various health issues like human disease, gene function, drug design, patient's conditions, etc. Searching for these patterns is an important step in unraveling the mechanisms of gene expression properly diagnose and treat chronic disease. Thus, motif identification has a vital role in healthcare studies and attracts many researchers. Numerous approaches have been characterized for the motif discovery process. This article attempts to review and analyze fifty-four of the most frequently found motif discovery processes/algorithms from different approaches and summarizes the discussion with their strengths and weaknesses.

Keywords: evolutionary approach; hashing; local search approach; mismatch tree; probabilistic approach; search tree; suffix tree; tries.

Publication types

  • Review

MeSH terms

  • Algorithms*
  • Binding Sites
  • DNA*
  • Humans
  • Sequence Analysis, DNA
  • Transcription Factors / genetics

Substances

  • Transcription Factors
  • DNA