Adjusting for COPD severity in database research: developing and validating an algorithm

Int J Chron Obstruct Pulmon Dis. 2011:6:669-78. doi: 10.2147/COPD.S26214. Epub 2011 Dec 6.

Abstract

Purpose: When comparing chronic obstructive lung disease (COPD) interventions in database research, it is important to adjust for severity. Global Initiative for Chronic Obstructive Lung Disease (GOLD) guidelines grade severity according to lung function. Most databases lack data on lung function. Previous database research has approximated COPD severity using demographics and healthcare utilization. This study aims to derive an algorithm for COPD severity using baseline data from a large respiratory trial (UPLIFT).

Methods: Partial proportional odds logit models were developed for probabilities of being in GOLD stages II, III and IV. Concordance between predicted and observed stage was assessed using kappa-statistics. Models were estimated in a random selection of 2/3 of patients and validated in the remainder. The analysis was repeated in a subsample with a balanced distribution across severity stages. Univariate associations of COPD severity with the covariates were tested as well.

Results: More severe COPD was associated with being male and younger, having quit smoking, lower BMI, osteoporosis, hospitalizations, using certain medications, and oxygen. After adjusting for these variables, co-morbidities, previous healthcare resource use (eg, emergency room, hospitalizations) and inhaled corticosteroids, xanthines, or mucolytics were no longer independently associated with COPD severity, although they were in univariate tests. The concordance was poor (kappa = 0.151) and only slightly better in the balanced sample (kappa = 0.215).

Conclusion: COPD severity cannot be reliably predicted from demographics and healthcare use. This limitation should be considered when interpreting findings from database studies, and additional research should explore other methods to account for COPD severity.

Keywords: GOLD; healthcare resource use; partial proportional odds logit.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Age Factors
  • Aged
  • Algorithms*
  • Australia / epidemiology
  • Body Mass Index
  • Data Mining / methods*
  • Databases, Factual*
  • Europe / epidemiology
  • Female
  • Health Resources / statistics & numerical data
  • Humans
  • Logistic Models
  • Lung / physiopathology*
  • Male
  • Middle Aged
  • Models, Statistical*
  • Multicenter Studies as Topic
  • Multivariate Analysis
  • New Zealand / epidemiology
  • Osteoporosis / epidemiology
  • Predictive Value of Tests
  • Pulmonary Disease, Chronic Obstructive / diagnosis*
  • Pulmonary Disease, Chronic Obstructive / physiopathology
  • Pulmonary Disease, Chronic Obstructive / therapy
  • Randomized Controlled Trials as Topic
  • Reproducibility of Results
  • Respiratory Function Tests*
  • Severity of Illness Index
  • Sex Factors
  • Smoking / epidemiology
  • Treatment Outcome
  • United States / epidemiology