Using views for retrieving data from extremely heterogeneous databanks

Pac Symp Biocomput. 1997:134-41.

Abstract

Information in molecular biology or biology in general is contained in a multitude of different databanks each specializing in a certain area. This specialization is useful since it improves the maintainability of the data and the flexibility of the overall structure which until now manages to exist without almost any standards. However, information gathering from these extremely heterogeneous sources is difficult since the desired data may be scattered over many databanks. This paper presents the concept of views implemented in the retrieval system SRS that uses links between databanks and a sophisticated parsing engine for information extraction to provide a flexible way of searching and obtaining data across databank boundaries. The implementation provides homogeneous access to all databanks using two types of views: the list-view which displays selected data-fields in their original format and the table-view available for HTML and plain ASCII text which tries to represent the data in a format independent from that of the source databank.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Biology / methods
  • Cattle
  • Databases, Factual*
  • Enzymes / chemistry
  • Humans
  • Information Storage and Retrieval*
  • Information Systems*
  • Molecular Biology / methods
  • Proteins / chemistry*
  • Software

Substances

  • Enzymes
  • Proteins