Using probabilistic corrections to account for abstractor agreement in medical record reviews

Am J Epidemiol. 2007 Jun 15;165(12):1454-61. doi: 10.1093/aje/kwm034. Epub 2007 Apr 3.

Abstract

The quality of medical record abstracts is often characterized in a reliability substudy. These results usually indicate agreement, but not the extent to which lack of agreement affects associations observed in the complete data. In this study, medical records were reviewed and abstracted for patients diagnosed with stage I or stage II breast cancer between 1990 and 1994 at one of six US Cancer Research Network sites. For a subsample, interrater reliability data were available. The authors calculated conventional hazard ratios and 95% confidence intervals for the association of demographic, tumor, and treatment characteristics with recurrence rate. These conventional estimates of effect were compared with three sets of estimates and 95% simulation intervals that took account of the uncertainty assessed by lack of agreement in the reliability substudy. The rate of recurrence was associated with increasing cancer stage and with treatment modality but not with demographic characteristics. The hazard ratios and simulation intervals that took account of the reliability data showed that the simulation interval grew wider as the sources of uncertainty taken into account grew more complete, but the associations expected a priori remained readily apparent. While many investigators use reliability data only as a metric for data quality, a more thorough approach can also quantitatively depict the uncertainty in the observed associations.

Publication types

  • Multicenter Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Abstracting and Indexing*
  • Aged
  • Aged, 80 and over
  • Breast Neoplasms / epidemiology*
  • Breast Neoplasms / pathology
  • Breast Neoplasms / surgery
  • Comorbidity
  • Data Interpretation, Statistical*
  • Female
  • Humans
  • Mastectomy / methods
  • Medical Records*
  • Neoplasm Recurrence, Local / epidemiology
  • Probability*
  • United States / epidemiology