Background: To evaluate screening and treatment strategies, large-scale real-world data on liver disease-related outcomes are needed. We sought to validate health administrative data for identification of cirrhosis, decompensated cirrhosis and hepatocellular carcinoma among patients with known liver disease.
Methods: Primary patient data were abstracted from patients of the Toronto Center for Liver Disease with viral hepatitis (2006-2014), and all patients with liver disease from the Kingston Health Sciences Centre Hepatology Clinic (2013). We linked clinical information to health administrative data and tested a range of coding algorithms against the clinical reference standard.
Results: A total of 6,714 patients had primary chart data abstracted. A single physician visit code for cirrhosis was sensitive (98-99%), and a single hospital diagnostic code for cirrhosis was specific (91-96%). The most sensitive algorithm for decompensated cirrhosis was one cirrhosis code with any of: a hospital diagnostic code, death code, or procedure code for decompensation (range 88-99% across groups). The most specific was one cirrhosis code and one hospital diagnostic code (range 89-98% across groups). Two physician visit codes or a single hospital diagnostic code, death code, or procedure code combined with a code for cirrhosis were sensitive and specific for hepatocellular carcinoma (sensitivity 94-96%, specificity 93-98%).
Conclusion: These sensitive and specific algorithms can be used to define patient cohorts or detect clinical outcomes using health administrative data. Our results will facilitate research into the adequacy of screening and treatment for patients with chronic viral hepatitis or other liver diseases.