Computational tools for exploring sequence databases as a resource for antimicrobial peptides

Biotechnol Adv. 2017 May-Jun;35(3):337-349. doi: 10.1016/j.biotechadv.2017.02.001. Epub 2017 Feb 12.

Abstract

Data mining has been recognized by many researchers as a hot topic in different areas. In the post-genomic era, the growing number of sequences deposited in databases has been the reason why these databases have become a resource for novel biological information. In recent years, the identification of antimicrobial peptides (AMPs) in databases has gained attention. The identification of unannotated AMPs has shed some light on the distribution and evolution of AMPs and, in some cases, indicated suitable candidates for developing novel antimicrobial agents. The data mining process has been performed mainly by local alignments and/or regular expressions. Nevertheless, for the identification of distant homologous sequences, other techniques such as antimicrobial activity prediction and molecular modelling are required. In this context, this review addresses the tools and techniques, and also their limitations, for mining AMPs from databases. These methods could be helpful not only for the development of novel AMPs, but also for other kinds of proteins, at a higher level of structural genomics. Moreover, solving the problem of unannotated proteins could bring immeasurable benefits to society, especially in the case of AMPs, which could be helpful for developing novel antimicrobial agents and combating resistant bacteria.

Keywords: Antimicrobial activity prediction; Data mining; Local alignments; Molecular modelling; Profile-HMM; Regular expression; Structural genomics.

Publication types

  • Review

MeSH terms

  • Antimicrobial Cationic Peptides* / chemistry
  • Antimicrobial Cationic Peptides* / genetics
  • Antimicrobial Cationic Peptides* / metabolism
  • Computational Biology / methods*
  • Data Mining
  • Databases, Genetic*
  • Models, Molecular
  • Sequence Alignment

Substances

  • Antimicrobial Cationic Peptides