The Impact of Item Misspecification and Dichotomization on Class and Parameter Recovery in LCA of Count Data

Multivariate Behav Res. 2019 Jan-Feb;54(1):113-145. doi: 10.1080/00273171.2018.1499499. Epub 2018 Dec 31.

Abstract

Mixture analysis of count data has become increasingly popular among researchers of substance use, behavioral analysis, and program evaluation. However, this increase in popularity seems to have occurred along with adoption of some conventions in model specification based on arbitrary heuristics that may impact the validity of results. Findings from a systematic review of recent drug and alcohol publications suggested count variables are often dichotomized or misspecified as continuous normal indicators in mixture analysis. Prior research suggests that misspecifying skewed distributions of continuous indicators in mixture analysis introduces bias, though the consequences of this practice when applied to count indicators has not been studied. The present work describes results from a simulation study examining bias in mixture recovery when count indicators are dichotomized (median split; presence vs. absence), ordinalized, or the distribution is misspecified (continuous normal; incorrect count distribution). All distributional misspecifications and methods of categorizing resulted in greater bias in parameter estimates and recovery of class membership relative to specifying the true distribution, though dichotomization appeared to improve class enumeration accuracy relative to all other specifications. Overall, results demonstrate the importance of accurately modeling count indicators in mixture analysis, as misspecification and categorizing data can distort study outcomes.

Keywords: Mixture modeling; Monte Carlo studies; class recovery; count data; dichotomization; latent class analysis; median split.

Publication types

  • Review

MeSH terms

  • Computer Simulation
  • Data Interpretation, Statistical*
  • Factor Analysis, Statistical
  • Latent Class Analysis*
  • Models, Statistical
  • Monte Carlo Method