Objectives: The US Center for Disease Control and Prevention's National Death Index (NDI) is a gold standard for mortality data, yet matching patients to the database depends on accurate and available key identifiers. Our objective was to evaluate NDI data for future healthcare research studies with mortality outcomes.
Methods: We used a Kaiser Permanente Mid-Atlantic States' Virtual Data Warehouse (KPMAS-VDW) sourced from the Social Security Administration and electronic health records on members enrolled between 1 January 2005 to 31 December 2017. We submitted data to NDI on 1 036 449 members. We compared results from the NDI best match algorithm to the KPMAS-VDW for vital status and death date. We compared probabilistic scores by sex and race and ethnicity.
Results: NDI returned 372 865 (36%) unique possible matches, 663 061 (64%) records not matched to the NDI database and 522 (<1%) rejected records. The NDI algorithm resulted in 38 862 records, presumed dead, with a lower percentage of women, and Asian/Pacific Islander and Hispanic people than presumed alive. There were 27 306 presumed dead members whose death dates matched exactly between the NDI results and VDW, but 1539 did not have an exact match. There were 10 017 additional deaths from NDI results that were not present in the VDW death data.
Conclusions: NDI data can substantially improve the overall capture of deaths. However, further quality control measures were needed to ensure the accuracy of the NDI best match algorithm.
Keywords: Data Management; Healthcare; Information Systems; Medical Record Linkage; Outcome Assessment; Public Health.
© Author(s) (or their employer(s)) 2023. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.