Estimating treatment effects in observational studies with both prevalent and incident cohorts

Can J Stat. 2017 Jun;45(2):202-219. doi: 10.1002/cjs.11317. Epub 2017 Apr 13.

Abstract

Registry databases are increasingly being used for comparative effectiveness research in cancer. Such databases reflect the real-world patient population and physician practice, and thus are natural sources for comparing multiple treatment scenarios and the associated long-term clinical outcomes. Registry databases usually include both incident and prevalent cohorts, which provide valuable complementary information for patients with more recent diagnoses in the incident cohort as well as patients with long-term follow-up data in the prevalent cohort. However, utilizing such data to derive valid inference poses two major challenges: the data from a prevalent cohort are not random samples of the target population, and there may be substantial differences in the baseline characteristics of patients between treatment arms, which influences the decisions about treatment selection in both cohorts. In this article, we extend propensity score methodology to observational studies that involve both prevalent and incident cohorts, and assess the effectiveness of radiation therapy in SEER-Medicare patients diagnosed with stage IV breast cancer. Specifically, we utilize the incident cohort to estimate the propensity for receiving radiation therapy, and then combine data from both the incident and prevalent cohorts to estimate the effect of radiation therapy by adjusting for the propensity scores in the model. We evaluate the proposed method with simulations. We demonstrate that the proposed propensity score method simultaneously removes sampling bias and selection bias under several assumptions.

Keywords: Incident cohort; Prevalent cohort; Propensity score; Sampling bias; Selection bias.