Confidence intervals for multinomial logistic regression in sparse data

Shelley B Bull; Juan Pablo Lewinger; Sophia S F Lee

doi:10.1002/sim.2518

Confidence intervals for multinomial logistic regression in sparse data

Stat Med. 2007 Feb 20;26(4):903-18. doi: 10.1002/sim.2518.

Authors

Shelley B Bull¹, Juan Pablo Lewinger, Sophia S F Lee

Affiliation

¹ Samuel Lunenfeld Research Institute, Prosserman Centre for Health Research, Mount Sinai Hospital, Toronto, Ont., Canada M5G 1X5. [email protected]

PMID: 16489602
DOI: 10.1002/sim.2518

Abstract

Logistic regression is one of the most widely used regression models in practice, but alternatives to conventional maximum likelihood estimation methods may be more appropriate for small or sparse samples. Modification of the logistic regression score function to remove first-order bias is equivalent to penalizing the likelihood by the Jeffreys prior, and yields penalized maximum likelihood estimates (PLEs) that always exist, even in samples in which maximum likelihood estimates (MLEs) are infinite. PLEs are an attractive alternative in small-to-moderate-sized samples, and are preferred to exact conditional MLEs when there are continuous covariates. We present methods to construct confidence intervals (CI) in the penalized multinomial logistic regression model, and compare CI coverage and length for the PLE-based methods to that of conventional MLE-based methods in trinomial logistic regressions with both binary and continuous covariates. Based on simulation studies in sparse data sets, we recommend profile CIs over asymptotic Wald-type intervals for the PLEs in all cases. Furthermore, when finite sample bias and data separation are likely to occur, we prefer PLE profile CIs over MLE methods.

Publication types

Comparative Study
Research Support, Non-U.S. Gov't

MeSH terms

Aspartate Aminotransferases / blood
Computer Simulation
Confidence Intervals*
Glutamate Dehydrogenase / chemistry
Hepatitis, Chronic / diagnosis
Hepatitis, Viral, Human / diagnosis
Humans
Likelihood Functions*
Logistic Models*
Transfusion Reaction

Substances

Glutamate Dehydrogenase
Aspartate Aminotransferases

Grants and funding

55118-1/Canadian Institutes of Health Research/Canada