BiClusO: A Novel Biclustering Approach and Its Application to Species-VOC Relational Data

IEEE/ACM Trans Comput Biol Bioinform. 2020 Nov-Dec;17(6):1955-1965. doi: 10.1109/TCBB.2019.2914901. Epub 2020 Dec 8.

Abstract

In this paper, we propose a novel biclustering approach called BiClusO. Biclustering can be applied to various types of bipartite data such as gene-condition or gene-disease relations. For example, we applied BiClusO to bipartite relations between species and volatile organic compounds (VOCs). VOCs, which are emitted by different species, have huge environmental and ecological impacts. The biosynthesis of VOCs depends on different metabolic pathways which can be used to categorize the species. A previous study related to the KNApSAcK VOC database classified microorganisms based on their VOC profiles, which confirmed the consistency between VOC-based and pathogenicity-based classifications. However, due to limited data, classification of all species in terms of VOC profiles was not performed. In this study, we enriched our database with additional data collected from different online sources and journals. Then, by applying BiClusO to species-VOC relational data, we determined that VOC-based classification is consistent with taxonomy-based classification of the species. We also assessed the diversity of VOC pathways across different kingdoms of species.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Bacteria / metabolism
  • Cluster Analysis
  • Computational Biology / methods*
  • Data Mining
  • Databases, Chemical
  • Gastrointestinal Microbiome / physiology
  • Humans
  • Volatile Organic Compounds* / analysis
  • Volatile Organic Compounds* / classification
  • Volatile Organic Compounds* / metabolism

Substances

  • Volatile Organic Compounds