GeneRIF quality assurance as summary revision

Pac Symp Biocomput. 2007:269-80. doi: 10.1142/9789812772435_0026.

Abstract

Like the primary scientific literature, GeneRIFs exhibit both growth and obsolescence. NLM's control over the contents of the Entrez Gene database provides a mechanism for dealing with obsolete data: GeneRIFs are removed from the database when they are found to be of low quality. However, the rapid and extensive growth of Entrez Gene makes manual location of low-quality GeneRIFs problematic. This paper presents a system that takes advantage of the summary-like quality of GeneRIFs to detect low-quality GeneRIFs via a summary revision approach, achieving precision of 89% and recall of 77%. Aspects of the system have been adopted by NLM as a quality assurance mechanism.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Computational Biology
  • Computer Simulation
  • Databases, Genetic / standards*
  • Models, Statistical
  • National Library of Medicine (U.S.)
  • PubMed / standards*
  • Quality Control
  • United States