Flagging unusual clusters based on linear mixed models using weighted and self-calibrated predictors

Charles E McCulloch; John M Neuhaus; Ross D Boylan

doi:10.1093/biomtc/ujae022

Flagging unusual clusters based on linear mixed models using weighted and self-calibrated predictors

Biometrics. 2024 Mar 27;80(2):ujae022. doi: 10.1093/biomtc/ujae022.

Authors

Charles E McCulloch¹, John M Neuhaus¹, Ross D Boylan¹

Affiliation

¹ Division of Biostatistics, Department of Epidemiology and Biostatistics, University of California, San Francisco 94158, United States.

PMID: 38563530
DOI: 10.1093/biomtc/ujae022

Abstract

Statistical models incorporating cluster-specific intercepts are commonly used in hierarchical settings, for example, observations clustered within patients or patients clustered within hospitals. Predicted values of these intercepts are often used to identify or "flag" extreme or outlying clusters, such as poorly performing hospitals or patients with rapid declines in their health. We consider a variety of flagging rules, assessing different predictors, and using different accuracy measures. Using theoretical calculations and comprehensive numerical evaluation, we show that previously proposed rules based on the 2 most commonly used predictors, the usual best linear unbiased predictor and fixed effects predictor, perform extremely poorly: the incorrect flagging rates are either unacceptably high (approaching 0.5 in the limit) or overly conservative (eg, much <0.05 for reasonable parameter values, leading to very low correct flagging rates). We develop novel methods for flagging extreme clusters that can control the incorrect flagging rates, including very simple-to-use versions that we call "self-calibrated." The new methods have substantially higher correct flagging rates than previously proposed methods for flagging extreme values, while controlling the incorrect flagging rates. We illustrate their application using data on length of stay in pediatric hospitals for children admitted for asthma diagnoses.

Keywords: hierarchical model; predicted random effects; profiling; weighted prediction.

MeSH terms

Asthma* / diagnosis
Child
Hospitalization
Humans
Linear Models
Models, Statistical*