Building a bioinformatics ontology using OIL

IEEE Trans Inf Technol Biomed. 2002 Jun;6(2):135-41. doi: 10.1109/titb.2002.1006301.

Abstract

This paper describes the initial stages of building an ontology of bioinformatics and molecular biology. The conceptualization is encoded using the ontology inference layer (OIL), a knowledge representation language that combines the modeling style of frame-based systems with the expressiveness and reasoning power of description logics (DLs). This paper is the second of a pair in this special issue. The first described the core of the OIL language and the need to use ontologies to deliver semantic bioinformatics resources. In this paper, the early stages of building an ontology component of a bioinformatics resource querying application are described. This ontology (TaO) holds the information about molecular biology represented in bioinformatics resources and the bioinformatics tasks performed over these resources. It, therefore, represents the metadata of the resources the application can query. It also manages the terminologies used in constructing the query plans used to retrieve instances from those external resources. The methodology used in this task capitalizes upon features of OIL-The conceptualization afforded by the frame-based view of OIL's syntax; the expressive power and reasoning of the logical formalism; and the ability to encode both handcrafted, hierarchies of concepts, as well as defining concepts in terms of their properties, which can then be used to establish a classification and infer relationships not encoded by the ontologist. This ability forms the basis of the methodology described here: For each portion of the TaO, a basic framework of concepts is asserted by the ontologist. Then, the properties of these concepts are defined by the ontologist and the logic's reasoning power used to reclassify and infer further relationships. This cycle of elaboration and refinement is iterated on each portion of the ontology until a satisfactory ontology has been created.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence
  • Computational Biology*
  • Computer Communication Networks
  • Database Management Systems*
  • Databases, Factual / classification
  • Decision Support Techniques
  • Feasibility Studies
  • Information Storage and Retrieval / methods*
  • Internet
  • Models, Theoretical
  • Programming Languages*
  • Semantics*