Screening for data clustering in multicenter studies: the residual intraclass correlation

BMC Med Res Methodol. 2013 Oct 23:13:128. doi: 10.1186/1471-2288-13-128.

Abstract

Background: In multicenter studies, center-specific variations in measurements may arise for various reasons, such as low interrater reliability, differences in equipment, deviations from the protocol, sociocultural characteristics, and differences in patient populations due to e.g. local referral patterns. The aim of this research is to derive measures for the degree of clustering. We present a method to detect heavily clustered variables and to identify physicians with outlying measurements.

Methods: We use regression models with fixed effects to account for patient case-mix and a random cluster intercept to study clustering by physicians. We propose to use the residual intraclass correlation (RICC), the proportion of residual variance that is situated at the cluster level, to detect variables that are influenced by clustering. An RICC of 0 indicates that the variance in the measurements is not due to variation between clusters. We further suggest, where appropriate, to evaluate RICC in combination with R2, the proportion of variance that is explained by the fixed effects. Variables with a high R2 may have benefits that outweigh the disadvantages of clustering in terms of statistical analysis. We apply the proposed methods to a dataset collected for the development of models for ovarian tumor diagnosis. We study the variability in 18 tumor characteristics collected through ultrasound examination, 4 patient characteristics, and the serum marker CA-125 measured by 40 physicians on 2407 patients.

Results: The RICC showed large variation between variables: from 2.2% for age to 25.1% for the amount of fluid in the pouch of Douglas. Seven variables had an RICC above 15%, indicating that a considerable part of the variance is due to systematic differences at the physician level, rather than random differences at the patient level. Accounting for differences in ultrasound machine quality reduced the RICC for a number of blood flow measurements.

Conclusions: We recommend that the degree of data clustering is addressed during the monitoring and analysis of multicenter studies. The RICC is a useful tool that expresses the degree of clustering as a percentage. Specific applications are data quality monitoring and variable screening prior to the development of a prediction model.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • CA-125 Antigen / blood
  • Cluster Analysis
  • Data Interpretation, Statistical
  • Female
  • Humans
  • Models, Statistical
  • Multicenter Studies as Topic / methods*
  • Observer Variation
  • Ovarian Neoplasms / blood
  • Ovarian Neoplasms / diagnostic imaging
  • Regression Analysis
  • Reproducibility of Results
  • Ultrasonography / standards

Substances

  • CA-125 Antigen