Interpretation and identification of within-unit and cross-sectional variation in panel data models

Jonathan Kropko; Robert Kubinec

doi:10.1371/journal.pone.0231349

Interpretation and identification of within-unit and cross-sectional variation in panel data models

PLoS One. 2020 Apr 21;15(4):e0231349. doi: 10.1371/journal.pone.0231349. eCollection 2020.

Authors

Jonathan Kropko¹, Robert Kubinec²

Affiliations

¹ School of Data Science, University of Virginia, Charlottesville, Virginia, United States of America.
² Division of Social Sciences, New York University Abu Dhabi, Abu Dhabi, United Arab Emirates.

Abstract

While fixed effects (FE) models are often employed to address potential omitted variables, we argue that these models' real utility is in isolating a particular dimension of variance from panel data for analysis. In addition, we show through novel mathematical decomposition and simulation that only one-way FE models cleanly capture either the over-time or cross-sectional dimensions in panel data, while the two-way FE model unhelpfully combines within-unit and cross-sectional variation in a way that produces un-interpretable answers. In fact, as we show in this paper, if we begin with the interpretation that many researchers wrongly assign to the two-way FE model-that it represents a single estimate of X on Y while accounting for unit-level heterogeneity and time shocks-the two-way FE specification is statistically unidentified, a fact that statistical software packages like R and Stata obscure through internal matrix processing.

MeSH terms

Data Interpretation, Statistical
Databases, Factual
Models, Statistical*

Grants and funding

The authors received no specific funding for this work.