Validity of administrative data for the diagnosis of primary sclerosing cholangitis: a population-based study

Natalie A Molodecky; Robert P Myers; Herman W Barkema; Hude Quan; Gilaad G Kaplan

doi:10.1111/j.1478-3231.2011.02484.x

Validity of administrative data for the diagnosis of primary sclerosing cholangitis: a population-based study

Liver Int. 2011 May;31(5):712-20. doi: 10.1111/j.1478-3231.2011.02484.x. Epub 2011 Mar 9.

Authors

Natalie A Molodecky¹, Robert P Myers, Herman W Barkema, Hude Quan, Gilaad G Kaplan

Affiliation

¹ Division of Gastroenterology, University of Calgary, Calgary, AB, Canada.

PMID: 21457444
DOI: 10.1111/j.1478-3231.2011.02484.x

Abstract

Background/aims: Administrative databases could be useful in studying the epidemiology of primary sclerosing cholangitis (PSC); however, there is no information regarding the validity of the diagnostic code in administrative databases. The aims of this study were to determine the validity of administrative data for a diagnosis of PSC and generate algorithms for the identification of PSC patients.

Methods: The sensitivity (Se) and positive predictive value (PPV) of a PSC diagnosis based on administrative data from 2000 to 2003 were determined through chart review data. Algorithms were developed by considering variables associated with PSC and coding details. A logistic regression model was constructed using covariates associated with PSC. Based on this model, each subject was assigned a probability of having PSC. A cutoff value was selected that maximized the Se and specificity (Sp) of correctly predicting PSC cases.

Results: In the administrative data, the initial Se and PPV were 83.7 and 7.2% respectively. The optimal algorithm included one PSC code and one inflammatory bowel disease code and had Se 56% and PPV 59%. Overall, the algorithms yielded inadequate PPV and Se estimates to identify a cohort of true PSC cases. The predictive model was constructed using six covariates. For this model, the area under the receiver operating characteristic curve was 93.5%. A cutoff of 0.0729 was used, which maximized the Se 81.9% and Sp 90.7%; however, the PPV was 41.0%.

Conclusion: An algorithm for the identification of true PSC cases from administrative data was not possible. We recommend that PSC receives a distinct ICD code from ascending cholangitis.

Publication types

Research Support, Non-U.S. Gov't
Validation Study

MeSH terms

Adult
Aged
Alberta / epidemiology
Algorithms
Biopsy
Cholangitis, Sclerosing* / classification
Cholangitis, Sclerosing* / diagnosis
Cholangitis, Sclerosing* / epidemiology
Data Mining*
Databases as Topic
Female
Humans
International Classification of Diseases*
Logistic Models
Male
Middle Aged
Predictive Value of Tests
ROC Curve
Reproducibility of Results
Time Factors

Grants and funding

Canadian Institutes of Health Research/Canada