Classification algorithms to improve the accuracy of identifying patients hospitalized with community-acquired pneumonia using administrative data

Epidemiol Infect. 2011 Sep;139(9):1296-306. doi: 10.1017/S0950268810002529. Epub 2010 Nov 19.

Abstract

In epidemiological studies of community-acquired pneumonia (CAP) that utilize administrative data, cases are typically defined by the presence of a pneumonia hospital discharge diagnosis code. However, not all such hospitalizations represent true CAP cases. We identified 3991 hospitalizations during 1997-2005 in a managed care organization, and validated them as CAP or not by reviewing medical records. To improve the accuracy of CAP identification, classification algorithms that incorporated additional administrative information associated with the hospitalization were developed using the classification and regression tree analysis. We found that a pneumonia code designated as the primary discharge diagnosis and duration of hospital stay improved the classification of CAP hospitalizations. Compared to the commonly used method that is based on the presence of a primary discharge diagnosis code of pneumonia alone, these algorithms had higher sensitivity (81-98%) and positive predictive values (82-84%) with only modest decreases in specificity (48-82%) and negative predictive values (75-90%).

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Aged, 80 and over
  • Algorithms
  • Child
  • Child, Preschool
  • Community-Acquired Infections / classification*
  • Community-Acquired Infections / diagnosis
  • Community-Acquired Infections / epidemiology*
  • Female
  • Hospitalization / statistics & numerical data
  • Humans
  • Infant
  • Infant, Newborn
  • Male
  • Middle Aged
  • Pneumonia / classification*
  • Pneumonia / diagnosis
  • Pneumonia / epidemiology*
  • Predictive Value of Tests
  • Sensitivity and Specificity