Methodology used for "software for automated linkage in Italy" (SALI)

J Biomed Inform. 2001 Dec;34(6):387-95. doi: 10.1006/jbin.2002.1036.

Abstract

Linkage of epidemiological registries can provide cost-effective information on the associations between different diseases or exposures in the population under study and on completeness of surveillance system databases. We describe the program SALI (software for automated linkage in Italy) aimed at matching individual records from medium-sized registries (in the order of 100,000 records), where the desired outcome is to miss as few links as possible and, because of low link-likelihood (< 1%), a manual revision of matched pairs is feasible. SALI, developed in CA-Clipper language, uses registry files in dBase format. It requires only name, surname, and date of birth as key fields, and it allows for spelling errors in Italian or other Latin languages through a specific algorithm. Furthermore, a double-blind procedure ensures data confidentiality. The main linkage procedure is based on four stages, two automatic ones, and two where the operator can decide through specific windows whether to accept stage-selected matches. SALI takes into account possible errors in key fields thus reducing false negatives. It was used to solve the problem of linkage between AIDS and cancer registries in Italy. It can be used with every IBM-compatible computer system, assuring uniquely high portability.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acquired Immunodeficiency Syndrome
  • Algorithms
  • Databases, Factual
  • Epidemiologic Studies
  • Humans
  • Italy
  • Medical Informatics
  • Neoplasms
  • Registries*
  • Software*