License: CC BY 4.0
arXiv:2312.04150v1 [stat.ME] 07 Dec 2023
\copyyear

2023 \startpage1 \authormarkTang et al. \titlemarkBounds for the average treatment effect \corresSatoshi Hattori, Department of Biomedical Statistics, Graduate School of Medicine, Osaka University, Osaka, Japan.

A simple sensitivity analysis method for unmeasured confounders via linear programming with estimating equation constraints

Chengyao Tang    Yi Zhou    Ao Huang    Satoshi Hattori \orgdivDepartment of Biomedical Statistics, Graduate School of Medicine, \orgnameOsaka University, \orgaddress\stateOsaka, \countryJapan \orgdivBeijing International Center for Mathematical Research, \orgnamePeking University,\orgaddress\stateBeijing, \countryChina \orgdivDepartment of Medical Statistics, \orgnameUniversity Medical Center Göttingen, \orgaddress\stateGöttingen, \countryGermany \orgdivIntegrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives (OTRI), \orgnameOsaka University, \orgaddress\stateOsaka, \countryJapan [email protected]
Abstract

[Abstract]In estimating the average treatment effect in observational studies, the influence of confounders should be appropriately addressed. To this end, the propensity score is widely used. If the propensity scores are known for all the subjects, bias due to confounders can be adjusted by using the inverse probability weighting (IPW) by the propensity score. Since the propensity score is unknown in general, it is usually estimated by the parametric logistic regression model with unknown parameters estimated by solving the score equation under the strongly ignorable treatment assignment (SITA) assumption. Violation of the SITA assumption and/or misspecification of the propensity score model can cause serious bias in estimating the average treatment effect. To relax the SITA assumption, the IPW estimator based on the outcome-dependent propensity score has been successfully introduced. However, it still depends on the correctly specified parametric model and its identification. In this paper, we propose a simple sensitivity analysis method for unmeasured confounders. In the standard practice, the estimating equation is used to estimate the unknown parameters in the parametric propensity score model. Our idea is to make inference on the average causal effect by removing restrictive parametric model assumptions while still utilizing the estimating equation. Using estimating equations as constraints, which the true propensity scores asymptotically satisfy, we construct the worst-case bounds for the average treatment effect with linear programming. Different from the existing sensitivity analysis methods, we construct the worst-case bounds with minimal assumptions. We illustrate our proposal by simulation studies and a real-world example.

keywords:
Unmeasured confounders, sensitivity analysis, average treatment effect, linear programming
articletype: Article Type

1 Introduction

In observational studies, it is always crucial to adjust influence of confounders in estimating the average treatment effect (ATE). If all the confounders are observed and satisfy the strongly ignorable treatment assignment (SITA) assumption,1, 2 one can adjust the effects of confounders by using the propensity score. With the propensity score, inverse probability weighting (IPW)3, 4 is a popular approach. The IPW method constructs weights on the observations of each subject, and then the ATE can be identified by comparing the weighted outcomes of two groups.5 In practice, the propensity score is unknown. Then, the estimation of the propensity score usually relies on a parametric model such as the logistic regression under the SITA assumption. In most observational studies, it is untestable and implausible that there are no unmeasured confounders, and then the SITA assumption may fail to hold. Using the outcome-dependent propensity score is an option to make inference without the SITA assumption.6, 7 By incorporating the outcome variable in the model of the propensity score, we can make inference on the ATE without the SITA assumption. In general, the outcome-dependent propensity score is estimated by a parametric logistic regression model with the observed confounders and the outcome as explanatory variables. Thus, model misspecification is still of concern in the estimation of the outcome-dependent propensity score. Moreover, it has an unidentifiability issue;8, 9 that is, the estimating equation cannot determine the unknown parameters uniquely in the outcome-dependent propensity score. Then, the outcome-dependent propensity score cannot solve the issue of unmeasured confounders completely.

Sensitivity analysis is a useful tool to assess the potential impact of unmeasured confounders, and many sensitivity analysis methods have been developed. With the substantially increasing applications of the propensity score methods in the analysis of observational studies, there is a growing interest in employing sensitivity analysis methods in real-data analyses. Typical sensitivity analysis approaches involve formulating additional assumptions with regards to the relationships among unmeasured confounders, treatment assignments, and outcomes. These assumptions often take the form of plausible values for parameters that cannot be directly estimated from the observed data and must be set by analysts. Rosenbaum and Rubin 10 and Lin et al. 11 modeled the mechanism of confounding with both the measured and unmeasured confounders and then estimated the treatment effect parameter of interest. Alternatively, Cornfield et al. 12 and Ding and Vanderweele 13 developed methods to construct the bounds for the treatment effects to quantify the magnitude of the unmeasured confounders. These bounds were designed to elucidate the extent to which unmeasured confounders could influence observed causal estimates. Particularly, when the sensitivity parameters were expressed as risk ratios, the E-value 14 was introduced and has become a pivotal quantity in the realm of causal inference in observational studies. While the E-value can provide a bound without any model specification, the estimand is restrictive and the bound is likely to be wide, which can lead to inefficiency in sensitivity analysis.

For the sensitivity analysis approaches based on the IPW method to estimate the ATE, Li et al. 15 modeled the mean between-group differences of potential outcomes to correct bias in the presence of unmeasured confounders. Shen et al.16 proposed an IPW-based sensitivity analysis method by using two parameters, the variance of the multiplicative errors in the estimated propensity score and its correlations with the potential outcomes, to quantify the bias due to unmeasured confounders. Lu and Ding 17 extended the method of Li et al.15 into a more flexible sensitivity analysis framework, which can handle the IPW, outcome regression, and doubly robust estimators. In addition, Zhao et al. 18 constructed bounds for the ATE based on the IPW estimators by incorporating a marginal sensitivity model.19 Dorn and Guo 20 further refined this method and gave sharper bounds. These sensitivity analysis methods can address the impacts of violation of the SITA assumption by quantifying potential biases; however, they rely on untestable parametric assumptions on the departure from the SITA assumption, and it is practically difficult to set a relevant magnitude of the departure.

In this paper, a simple sensitivity analysis framework for unmeasured confounders is proposed. In the standard process of the confounder adjustment with the outcome-dependent propensity score, a parametric model for the propensity score is assumed, and an estimating equation is introduced to estimate its unknown parameters. Instead of determining a unique model for the outcome-dependent propensity score, we construct bounds for the ATE by considering possible propensity scores. We realize it by removing the parametric model for the propensity score, but still relying on the estimating equation. We introduce an optimization problem constrained by the estimating equation, which the true propensity score asymptotically satisfies. The worst-case bounds for the ATE can be obtained by solving a linear programming problem. Different from the existing sensitivity analysis methods, the proposed worst-case bounds do not rely on strong assumptions. By increasing the dimension of the estimating equations involving many covariates, one can make the bounds further narrow. Compared with existing sensitivity analysis methods, the proposed method offers the following advantages. First, the proposed method can provide worst-case bounds with minimal assumptions. Second, since the proposed method is free from the estimated propensity score under the SITA assumption, its misspecification does not matter. Finally, our method exhibits computational efficiency as the optimization problem can be solved by linear programming.

The rest of this paper is organized as follows. In Section 2, we introduce the basic notations and the standard methods with the parametric propensity score. In Section 3, some existing sensitivity analysis methods for the IPW estimator are reviewed. In Section 4, the proposed method for sensitivity analysis is introduced. We investigate the performance of the proposed method on simulated datasets in Section 5, and illustrate it on a real-world example in Section 6. In Section 7, we provide a concluding discussion to summarize the main findings and contributions of this paper.

2 Estimation with the parametric propensity score

2.1 Notations and the standard propensity score analysis

In this paper, we consider to estimate the ATE for the overall mean over the population in an observational study with two treatment groups. Let Z𝑍Zitalic_Z be the treatment assignment: Z=1𝑍1Z=1italic_Z = 1 if the subject is in the treated (exposed) group and Z=0𝑍0Z=0italic_Z = 0 if in the control group. Let X𝑋Xitalic_X be a vector of baseline covariates and Y𝑌Yitalic_Y be the observed outcome. We follow Rubin’s causal model framework.21 Let Y(1)superscript𝑌1Y^{(1)}italic_Y start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT and Y(0)superscript𝑌0Y^{(0)}italic_Y start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT be the potential outcomes if the subjects were assigned to the treated group (Z=1𝑍1Z=1italic_Z = 1) and the control group (Z=0𝑍0Z=0italic_Z = 0), respectively. Suppose the observational study enrolls n𝑛nitalic_n subjects, and the observed data (Yi,Zi,Xi)subscript𝑌𝑖subscript𝑍𝑖subscript𝑋𝑖(Y_{i},Z_{i},X_{i})( italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) for subject i𝑖iitalic_i (i=1,2,,n𝑖12𝑛i=1,2,\dots,nitalic_i = 1 , 2 , … , italic_n) available, which are independent and identically distributed copies of (Y,Z,X)𝑌𝑍𝑋(Y,Z,X)( italic_Y , italic_Z , italic_X ). Denote μ1=E[Y(1)]subscript𝜇1𝐸delimited-[]superscript𝑌1\mu_{1}=E[Y^{(1)}]italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_E [ italic_Y start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ] and μ0=E[Y(0)]subscript𝜇0𝐸delimited-[]superscript𝑌0\mu_{0}=E[Y^{(0)}]italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_E [ italic_Y start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ]. The ATE, which is of our primary interest to estimate, is defined by

ψ=μ1μ0=E[Y(1)]E[Y(0)].𝜓subscript𝜇1subscript𝜇0𝐸delimited-[]superscript𝑌1𝐸delimited-[]superscript𝑌0\psi=\mu_{1}-\mu_{0}=E[Y^{(1)}]-E[Y^{(0)}].italic_ψ = italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_E [ italic_Y start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ] - italic_E [ italic_Y start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ] .

In observational studies, owing to the absence of randomization, the potential influence of confounders should be carefully handled in estimating the ATE. The propensity score is widely used to adjust the bias due to confounding. The propensity score is defined by e(Xi)=p(Zi=1Xi)𝑒subscript𝑋𝑖𝑝subscript𝑍𝑖conditional1subscript𝑋𝑖e(X_{i})=p(Z_{i}=1\mid X_{i})italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_p ( italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 ∣ italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). Various methods, such as stratification, matching, and IPW,3, 4, 22 can be employed to adjust for confounding with the propensity score. The standard propensity analysis is conducted under the following assumptions: {assumption} Consistency: Yi=ZiYi(1)+(1Zi)Yi(0)subscript𝑌𝑖subscript𝑍𝑖superscriptsubscript𝑌𝑖11subscript𝑍𝑖superscriptsubscript𝑌𝑖0Y_{i}=Z_{i}Y_{i}^{(1)}+(1-Z_{i})Y_{i}^{(0)}italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT + ( 1 - italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT . {assumption} Positivity: there exists a small positive parameter δ𝛿\deltaitalic_δ such that 0<δe(Xi)1δ0𝛿𝑒subscript𝑋𝑖1𝛿0<\delta\leq e(X_{i})\leq 1-\delta0 < italic_δ ≤ italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≤ 1 - italic_δ, for all subjects i𝑖iitalic_i. {assumption} SITA: (Yi(1),Yi(0))ZiXi(Y^{(1)}_{i},Y^{(0)}_{i})\perp\!\!\!\perp Z_{i}\mid X_{i}( italic_Y start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ⟂ ⟂ italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Assumption 3 implies that the bias due to confounding can be adjusted by using X𝑋Xitalic_X in principle. The SITA assumption is corresponding to the Missing At Random (MAR) in the missing data analysis context. In this paper, we handle situations in which the SITA is violated, which is corresponding to the concept of the Missing Not At Random (MNAR) in the missing data problem. We use the terminologies SITA and MAR exchangeably. In practice, the propensity score is unknown, and then some parametric models such as the logistic regression model is usually assumed. Let logit(e(Xi;θ,α))=θ+αXilogit𝑒subscript𝑋𝑖𝜃𝛼𝜃superscript𝛼topsubscript𝑋𝑖\mbox{logit}(e(X_{i};\theta,\alpha))=\theta+\alpha^{\top}X_{i}logit ( italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_θ , italic_α ) ) = italic_θ + italic_α start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. The unknown parameters are usually estimated by solving the following score equation:

i=1n(1Xi)(Ziexp(θ+αXi)1+exp(θ+αXi))=0.superscriptsubscript𝑖1𝑛1missing-subexpressionsubscript𝑋𝑖missing-subexpressionsubscript𝑍𝑖𝑒𝑥𝑝𝜃superscript𝛼topsubscript𝑋𝑖1𝑒𝑥𝑝𝜃superscript𝛼topsubscript𝑋𝑖0\sum_{i=1}^{n}\left(\begin{array}[]{cc}1\\ X_{i}\end{array}\right)\left(Z_{i}-\frac{exp(\theta+\alpha^{\top}X_{i})}{1+exp% (\theta+\alpha^{\top}X_{i})}\right)=0.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( start_ARRAY start_ROW start_CELL 1 end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW end_ARRAY ) ( italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - divide start_ARG italic_e italic_x italic_p ( italic_θ + italic_α start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG start_ARG 1 + italic_e italic_x italic_p ( italic_θ + italic_α start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) = 0 . (1)

Let the solution to the score equation for (θ,α)𝜃𝛼(\theta,\alpha)( italic_θ , italic_α ) be denoted by (θ^,α^)^𝜃^𝛼(\hat{\theta},\hat{\alpha})( over^ start_ARG italic_θ end_ARG , over^ start_ARG italic_α end_ARG ), and e^(Xi)=e(Xi;θ^,α^)^𝑒subscript𝑋𝑖𝑒subscript𝑋𝑖^𝜃^𝛼\hat{e}(X_{i})=e(X_{i};\hat{\theta},\hat{\alpha})over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; over^ start_ARG italic_θ end_ARG , over^ start_ARG italic_α end_ARG ). Then, we can determine the unique set of propensity scores for all subjects. In this paper, the propensity score estimated under the SITA assumption is called the MAR-based propensity score to avoid confusion; another type of propensity score is introduced in a later section, which is called the outcome-dependent propensity score. The IPW estimator for μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is defined by

μ^1=1ni=1nZiYie^(Xi).subscript^𝜇11𝑛superscriptsubscript𝑖1𝑛subscript𝑍𝑖subscript𝑌𝑖^𝑒subscript𝑋𝑖\hat{\mu}_{1}=\frac{1}{n}\sum_{i=1}^{n}\frac{Z_{i}Y_{i}}{\hat{e}(X_{i})}.over^ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG . (2)

Similarly, we can estimate μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT with

μ^0=1ni=1n(1Zi)Yi1e^(Xi),subscript^𝜇01𝑛superscriptsubscript𝑖1𝑛1subscript𝑍𝑖subscript𝑌𝑖1^𝑒subscript𝑋𝑖\hat{\mu}_{0}=\frac{1}{n}\sum_{i=1}^{n}\frac{(1-Z_{i})Y_{i}}{1-\hat{e}(X_{i})},over^ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG ( 1 - italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG 1 - over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG , (3)

and then the ATE ψ𝜓\psiitalic_ψ is estimated with

ψ^=μ^1μ^0.^𝜓subscript^𝜇1subscript^𝜇0\hat{\psi}=\hat{\mu}_{1}-\hat{\mu}_{0}.over^ start_ARG italic_ψ end_ARG = over^ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - over^ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT .

The aforementioned IPW estimator has an unstabilized form, which may suffer from extremely large weights when some propensity scores are very close to one or zero, and then can cause instability in the estimation. The stabilized IPW (SIPW) estimator introduces a stabilization term to the weights, which helps mitigate the impact of extreme weights. Specifically, the SIPW estimator for μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is defined by

μ^1,SIPW=(1ni=1nZie^(Xi))11ni=1nZiYie^(Xi).subscript^𝜇1𝑆𝐼𝑃𝑊superscript1𝑛superscriptsubscript𝑖1𝑛subscript𝑍𝑖^𝑒subscript𝑋𝑖11𝑛superscriptsubscript𝑖1𝑛subscript𝑍𝑖subscript𝑌𝑖^𝑒subscript𝑋𝑖\hat{\mu}_{1,SIPW}=\left(\frac{1}{n}\sum_{i=1}^{n}\frac{Z_{i}}{\hat{e}(X_{i})}% \right)^{-1}\frac{1}{n}\sum_{i=1}^{n}\frac{Z_{i}Y_{i}}{\hat{e}(X_{i})}.over^ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 1 , italic_S italic_I italic_P italic_W end_POSTSUBSCRIPT = ( divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG . (4)

Similarly, we can estimate μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT with

μ^0,SIPW=(1ni=1n1Zi1e^(Xi))11ni=1n(1Zi)Yi1e^(Xi),subscript^𝜇0𝑆𝐼𝑃𝑊superscript1𝑛superscriptsubscript𝑖1𝑛1subscript𝑍𝑖1^𝑒subscript𝑋𝑖11𝑛superscriptsubscript𝑖1𝑛1subscript𝑍𝑖subscript𝑌𝑖1^𝑒subscript𝑋𝑖\hat{\mu}_{0,SIPW}=\left(\frac{1}{n}\sum_{i=1}^{n}\frac{1-Z_{i}}{1-\hat{e}(X_{% i})}\right)^{-1}\frac{1}{n}\sum_{i=1}^{n}\frac{(1-Z_{i})Y_{i}}{1-\hat{e}(X_{i}% )},over^ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 0 , italic_S italic_I italic_P italic_W end_POSTSUBSCRIPT = ( divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 - italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG 1 - over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG ( 1 - italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG 1 - over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG , (5)

and then the ATE ψ𝜓\psiitalic_ψ is estimated with

ψ^SIPW=μ^1,SIPWμ^0,SIPW.subscript^𝜓𝑆𝐼𝑃𝑊subscript^𝜇1𝑆𝐼𝑃𝑊subscript^𝜇0𝑆𝐼𝑃𝑊\hat{\psi}_{SIPW}=\hat{\mu}_{1,SIPW}-\hat{\mu}_{0,SIPW}.over^ start_ARG italic_ψ end_ARG start_POSTSUBSCRIPT italic_S italic_I italic_P italic_W end_POSTSUBSCRIPT = over^ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 1 , italic_S italic_I italic_P italic_W end_POSTSUBSCRIPT - over^ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 0 , italic_S italic_I italic_P italic_W end_POSTSUBSCRIPT .

In this paper, we focus on the SIPW estimator.

If the SITA assumption holds and the model of the propensity score is correctly specified, the ATE is consistently estimated. However, the SITA assumption does not hold in the presence of unmeasured confounders.

2.2 Estimation with the outcome-dependent propensity score

In this section, suppose that the SITA assumption does not necessarily hold in the presence of unmeasured confounder U𝑈Uitalic_U. The estimation of the ATE using the method in Section 2.1 is no longer valid. To address the issue of unmeasured confounders, the outcome-dependent propensity score approach23, 24, 25 has been successfully introduced. We define the outcome-dependent propensity scores by o1(Xi,Yi(1))=P(Zi=1Xi,Yi(1))superscript𝑜1subscript𝑋𝑖superscriptsubscript𝑌𝑖1𝑃subscript𝑍𝑖conditional1subscript𝑋𝑖superscriptsubscript𝑌𝑖1o^{1}(X_{i},Y_{i}^{(1)})=P(Z_{i}=1\mid X_{i},Y_{i}^{(1)})italic_o start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) = italic_P ( italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 ∣ italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) and o0(Xi,Yi(0))=P(Zi=1Xi,Yi(0))superscript𝑜0subscript𝑋𝑖superscriptsubscript𝑌𝑖0𝑃subscript𝑍𝑖conditional1subscript𝑋𝑖superscriptsubscript𝑌𝑖0o^{0}(X_{i},Y_{i}^{(0)})=P(Z_{i}=1\mid X_{i},Y_{i}^{(0)})italic_o start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) = italic_P ( italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 ∣ italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) for subjects in the treated and control group, respectively.

One may consider the logistic regression models for o1(Xi,Yi(1))superscript𝑜1subscript𝑋𝑖superscriptsubscript𝑌𝑖1o^{1}(X_{i},Y_{i}^{(1)})italic_o start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) and o0(Xi,Yi(0))superscript𝑜0subscript𝑋𝑖superscriptsubscript𝑌𝑖0o^{0}(X_{i},Y_{i}^{(0)})italic_o start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ). Let us consider the models logit(o1(Xi,Yi(1);θ1,α1,β1))=θ1+α1Xi+β1Yi(1)logitsuperscript𝑜1subscript𝑋𝑖superscriptsubscript𝑌𝑖1superscript𝜃1superscript𝛼1superscript𝛽1superscript𝜃1superscript𝛼limit-from1topsubscript𝑋𝑖superscript𝛽1superscriptsubscript𝑌𝑖1\mbox{logit}{(o^{1}(X_{i},Y_{i}^{(1)};\theta^{1},\alpha^{1},\beta^{1}))}=% \theta^{1}+\alpha^{1\top}X_{i}+\beta^{1}Y_{i}^{(1)}logit ( italic_o start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ; italic_θ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_α start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_β start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) ) = italic_θ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT + italic_α start_POSTSUPERSCRIPT 1 ⊤ end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_β start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT and logit(o0(Xi,Yi(0);θ0,α0,β0))=θ0+α0Xi+β0Yi(0)logitsuperscript𝑜0subscript𝑋𝑖superscriptsubscript𝑌𝑖0superscript𝜃0superscript𝛼0superscript𝛽0superscript𝜃0superscript𝛼limit-from0topsubscript𝑋𝑖superscript𝛽0superscriptsubscript𝑌𝑖0\mbox{logit}{(o^{0}(X_{i},Y_{i}^{(0)};\theta^{0},\alpha^{0},\beta^{0}))}=% \theta^{0}+\alpha^{0\top}X_{i}+\beta^{0}Y_{i}^{(0)}logit ( italic_o start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ; italic_θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , italic_α start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , italic_β start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ) ) = italic_θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT + italic_α start_POSTSUPERSCRIPT 0 ⊤ end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_β start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT. The score equation (1) does not work for estimation of the unknown parameters in these models, since Yi(z)superscriptsubscript𝑌𝑖𝑧Y_{i}^{(z)}italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_z ) end_POSTSUPERSCRIPT is observed only for subjects with Zi=zsubscript𝑍𝑖𝑧Z_{i}=zitalic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_z. The unknown parameters in the model of o1(Xi,Yi(1);θ1,α1,β1)superscript𝑜1subscript𝑋𝑖superscriptsubscript𝑌𝑖1superscript𝜃1superscript𝛼1superscript𝛽1o^{1}(X_{i},Y_{i}^{(1)};\theta^{1},\alpha^{1},\beta^{1})italic_o start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ; italic_θ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_α start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_β start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) can be estimated by solving the following estimating equation:

i=1ng(Xi)(1Zio1(Xi,Yi;θ1,α1,β1))=0,superscriptsubscript𝑖1𝑛𝑔subscript𝑋𝑖1subscript𝑍𝑖superscript𝑜1subscript𝑋𝑖subscript𝑌𝑖superscript𝜃1superscript𝛼1superscript𝛽10\sum_{i=1}^{n}g(X_{i})\left(1-\frac{Z_{i}}{o^{1}(X_{i},Y_{i};\theta^{1},\alpha% ^{1},\beta^{1})}\right)=0,∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 - divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_o start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_θ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_α start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_β start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) end_ARG ) = 0 , (6)

where g(X)𝑔𝑋g(X)italic_g ( italic_X ) is a vector of the same dimensions as (θ1,α1,β1)superscript𝜃1superscript𝛼1superscript𝛽1(\theta^{1},\alpha^{1},\beta^{1})( italic_θ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_α start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_β start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) and the solution to the estimating equation (6) is denoted by (θ^1,α^1,β^1)superscript^𝜃1superscript^𝛼1superscript^𝛽1(\hat{\theta}^{1},\hat{\alpha}^{1},\hat{\beta}^{1})( over^ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , over^ start_ARG italic_α end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , over^ start_ARG italic_β end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ). Similarly, the unknown parameters in the model of o0(Xi,Yi(0);θ0,α0,β0)superscript𝑜0subscript𝑋𝑖superscriptsubscript𝑌𝑖0superscript𝜃0superscript𝛼0superscript𝛽0o^{0}(X_{i},Y_{i}^{(0)};\theta^{0},\alpha^{0},\beta^{0})italic_o start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ; italic_θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , italic_α start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , italic_β start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ) can be estimated by solving the following estimating equation:

i=1ng(Xi)(11Zi1o0(Xi,Yi;θ0,α0,β0))=0.superscriptsubscript𝑖1𝑛𝑔subscript𝑋𝑖11subscript𝑍𝑖1superscript𝑜0subscript𝑋𝑖subscript𝑌𝑖superscript𝜃0superscript𝛼0superscript𝛽00\sum_{i=1}^{n}g(X_{i})\left(1-\frac{1-Z_{i}}{1-o^{0}(X_{i},Y_{i};\theta^{0},% \alpha^{0},\beta^{0})}\right)=0.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 - divide start_ARG 1 - italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_o start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , italic_α start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , italic_β start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ) end_ARG ) = 0 . (7)

The dimension of g(X)𝑔𝑋g(X)italic_g ( italic_X ) should be equal to that of (θ0,α0,β0)superscript𝜃0superscript𝛼0superscript𝛽0(\theta^{0},\alpha^{0},\beta^{0})( italic_θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , italic_α start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , italic_β start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ) to obtain a solution. The solution to the estimating equation (7) is denoted by (θ^0,α^0,β^0)superscript^𝜃0superscript^𝛼0superscript^𝛽0(\hat{\theta}^{0},\hat{\alpha}^{0},\hat{\beta}^{0})( over^ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , over^ start_ARG italic_α end_ARG start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , over^ start_ARG italic_β end_ARG start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ). Denote o^1(Xi,Yi(1))=o1(Xi,Yi;θ^1,α^1,β^1)superscript^𝑜1subscript𝑋𝑖subscriptsuperscript𝑌1𝑖superscript𝑜1subscript𝑋𝑖subscript𝑌𝑖superscript^𝜃1superscript^𝛼1superscript^𝛽1\hat{o}^{1}(X_{i},Y^{(1)}_{i})=o^{1}(X_{i},Y_{i};\hat{\theta}^{1},\hat{\alpha}% ^{1},\hat{\beta}^{1})over^ start_ARG italic_o end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_o start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; over^ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , over^ start_ARG italic_α end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , over^ start_ARG italic_β end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) and o^0(Xi,Yi(0))=o0(Xi,Yi;θ^0,α^0,β^0)superscript^𝑜0subscript𝑋𝑖subscriptsuperscript𝑌0𝑖superscript𝑜0subscript𝑋𝑖subscript𝑌𝑖superscript^𝜃0superscript^𝛼0superscript^𝛽0\hat{o}^{0}(X_{i},Y^{(0)}_{i})=o^{0}(X_{i},Y_{i};\hat{\theta}^{0},\hat{\alpha}% ^{0},\hat{\beta}^{0})over^ start_ARG italic_o end_ARG start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_o start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; over^ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , over^ start_ARG italic_α end_ARG start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , over^ start_ARG italic_β end_ARG start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ), respectively. We can then estimate μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT under the MNAR with

μ^1,SIPWMNAR=(1ni=1nZio^1(Xi,Yi(1)))11ni=1nZiYio^1(Xi,Yi(1)).subscriptsuperscript^𝜇𝑀𝑁𝐴𝑅1𝑆𝐼𝑃𝑊superscript1𝑛superscriptsubscript𝑖1𝑛subscript𝑍𝑖superscript^𝑜1subscript𝑋𝑖subscriptsuperscript𝑌1𝑖11𝑛superscriptsubscript𝑖1𝑛subscript𝑍𝑖subscript𝑌𝑖superscript^𝑜1subscript𝑋𝑖subscriptsuperscript𝑌1𝑖\hat{\mu}^{MNAR}_{1,SIPW}=\left(\frac{1}{n}\sum_{i=1}^{n}\frac{Z_{i}}{\hat{o}^% {1}(X_{i},Y^{(1)}_{i})}\right)^{-1}\frac{1}{n}\sum_{i=1}^{n}\frac{Z_{i}Y_{i}}{% \hat{o}^{1}(X_{i},Y^{(1)}_{i})}.over^ start_ARG italic_μ end_ARG start_POSTSUPERSCRIPT italic_M italic_N italic_A italic_R end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 , italic_S italic_I italic_P italic_W end_POSTSUBSCRIPT = ( divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_o end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_o end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG .

Similarly, we can estimate μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT under the MNAR with

μ^0,SIPWMNAR=(1ni=1n1Zi1o^0(Xi,Yi(0)))11ni=1n(1Zi)Yi1o^(Xi,Yi(0)),subscriptsuperscript^𝜇𝑀𝑁𝐴𝑅0𝑆𝐼𝑃𝑊superscript1𝑛superscriptsubscript𝑖1𝑛1subscript𝑍𝑖1superscript^𝑜0subscript𝑋𝑖subscriptsuperscript𝑌0𝑖11𝑛superscriptsubscript𝑖1𝑛1subscript𝑍𝑖subscript𝑌𝑖1^𝑜subscript𝑋𝑖subscriptsuperscript𝑌0𝑖\hat{\mu}^{MNAR}_{0,SIPW}=\left(\frac{1}{n}\sum_{i=1}^{n}\frac{1-Z_{i}}{1-\hat% {o}^{0}(X_{i},Y^{(0)}_{i})}\right)^{-1}\frac{1}{n}\sum_{i=1}^{n}\frac{(1-Z_{i}% )Y_{i}}{1-\hat{o}(X_{i},Y^{(0)}_{i})},over^ start_ARG italic_μ end_ARG start_POSTSUPERSCRIPT italic_M italic_N italic_A italic_R end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 , italic_S italic_I italic_P italic_W end_POSTSUBSCRIPT = ( divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 - italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG 1 - over^ start_ARG italic_o end_ARG start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG ( 1 - italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG 1 - over^ start_ARG italic_o end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ,

and then the ATE is estimated with

ψ^SIPWMNAR=μ^1,SIPWMNARμ^0,SIPWMNAR.subscriptsuperscript^𝜓𝑀𝑁𝐴𝑅𝑆𝐼𝑃𝑊subscriptsuperscript^𝜇𝑀𝑁𝐴𝑅1𝑆𝐼𝑃𝑊subscriptsuperscript^𝜇𝑀𝑁𝐴𝑅0𝑆𝐼𝑃𝑊\hat{\psi}^{MNAR}_{SIPW}=\hat{\mu}^{MNAR}_{1,SIPW}-\hat{\mu}^{MNAR}_{0,SIPW}.over^ start_ARG italic_ψ end_ARG start_POSTSUPERSCRIPT italic_M italic_N italic_A italic_R end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_S italic_I italic_P italic_W end_POSTSUBSCRIPT = over^ start_ARG italic_μ end_ARG start_POSTSUPERSCRIPT italic_M italic_N italic_A italic_R end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 , italic_S italic_I italic_P italic_W end_POSTSUBSCRIPT - over^ start_ARG italic_μ end_ARG start_POSTSUPERSCRIPT italic_M italic_N italic_A italic_R end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 , italic_S italic_I italic_P italic_W end_POSTSUBSCRIPT .

The SIPW estimator with the outcome-dependent propensity score can consistently estimate the ATE without the SITA assumption as long as the parametric models for the outcome-dependent propensity score are correctly specified. However, estimations with (6) and (7) often encounter an unidentifiability issue, wherein the model coefficients obtained through solving the estimating equations may not be uniquely determined. Miao et al.9 pointed out that even if the model for the propensity score has a known parametric form, the model is not identifiable without specifying a parametric outcome distribution. A unique solution to the estimating equations is only achieved when both the outcome model and the propensity score model are appropriately specified. Specifically, without additional restrictions or assumptions, sorely solving the estimating equations (6) is not sufficient to determine the coefficients (θ^1,α^1,β^1)superscript^𝜃1superscript^𝛼1superscript^𝛽1(\hat{\theta}^{1},\hat{\alpha}^{1},\hat{\beta}^{1})( over^ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , over^ start_ARG italic_α end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , over^ start_ARG italic_β end_ARG start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) uniquely. Therefore, the outcome-dependent propensity score cannot solve the issue of the unmeasured confounder completely.

3 Existing sensitivity analysis methods

In this section, we will briefly review some existing sensitivity analysis methods for the IPW estimator.

3.1 Modeling the mean difference of the potential outcomes

Along with the lines of the work by Robins et al.,26, 27 Brumback et al. 28 proposed to quantify the impact of the unmeasured confounders by modeling the mean between-group difference of the potential outcomes, conditional on all observed covariates. The sensitivity function is defined by c(z,X)=E[Y(z)Z=1,X]E[Y(z)Z=0,X]𝑐𝑧𝑋𝐸delimited-[]conditionalsuperscript𝑌𝑧𝑍1𝑋𝐸delimited-[]conditionalsuperscript𝑌𝑧𝑍0𝑋c(z,X)=E[Y^{(z)}\mid Z=1,X]-E[Y^{(z)}\mid Z=0,X]italic_c ( italic_z , italic_X ) = italic_E [ italic_Y start_POSTSUPERSCRIPT ( italic_z ) end_POSTSUPERSCRIPT ∣ italic_Z = 1 , italic_X ] - italic_E [ italic_Y start_POSTSUPERSCRIPT ( italic_z ) end_POSTSUPERSCRIPT ∣ italic_Z = 0 , italic_X ]. If the SITA assumption holds, c(z,X)𝑐𝑧𝑋c(z,X)italic_c ( italic_z , italic_X ) equals zero. Thus, the sensitivity function can describe the magnitude of the departure from SITA assumption or the impact of the unmeasured confounders. Once we specify the sensitivity function c(z,X)𝑐𝑧𝑋c(z,X)italic_c ( italic_z , italic_X ), one can predict the mean function of the counterfactual variables conditional on X𝑋Xitalic_X and then estimate the ATE without the SITA assumption. Li et al. 15 criticized a technical difficulty in defining the sensitivity function when covariates X𝑋Xitalic_X contain multiple dimensions. Of note, in practical sensitivity analysis, if X𝑋Xitalic_X is multi-dimensional, not only the functional form but also the specific coefficients for each covariate are required to be specified. Such specifications were criticized to be unlikely to accurately reflect the relationship between the departure from SITA assumption and the potential outcomes. Li et al. 15 proposed a refinement by defining the sensitivity function as a function of the MAR-based propensity score: c(z,e(X))=E[Y(z)Z=1,e(X)]E[Y(z)Z=0,e(X)]𝑐𝑧𝑒𝑋𝐸delimited-[]conditionalsuperscript𝑌𝑧𝑍1𝑒𝑋𝐸delimited-[]conditionalsuperscript𝑌𝑧𝑍0𝑒𝑋c(z,e(X))=E[Y^{(z)}\mid Z=1,e(X)]-E[Y^{(z)}\mid Z=0,e(X)]italic_c ( italic_z , italic_e ( italic_X ) ) = italic_E [ italic_Y start_POSTSUPERSCRIPT ( italic_z ) end_POSTSUPERSCRIPT ∣ italic_Z = 1 , italic_e ( italic_X ) ] - italic_E [ italic_Y start_POSTSUPERSCRIPT ( italic_z ) end_POSTSUPERSCRIPT ∣ italic_Z = 0 , italic_e ( italic_X ) ]. The MAR-based propensity score is a one-dimension summary of observed covariates, and this refinement made the specification of the sensitivity function much simpler.

However, in reality, even with the simplification by Li et al.15, it is not an easy task to define a plausible range of sensitivity functions. Furthermore, their method still relies on the estimation of the MAR-based propensity score. Misspecification of the parametric model for the MAR-based propensity score may result in difficulty in interpreting the results of the sensitivity analysis.

3.2 The marginal sensitivity model

Tan 19 proposed the marginal sensitivity model, which describes a relaxation of the SITA assumption. The model assumes a single sensitivity parameter, which permits the presence of the unmeasured confounders U𝑈Uitalic_U, but restricts the extent of selection bias that can be attributed to these confounders. One can specify a parameter λ𝜆\lambdaitalic_λ, and then the following inequality is supposed to hold:

1/λe(Xi,Ui)/(1e(Xi,Ui))e^(Xi)/(1e^(Xi))λ,1𝜆𝑒subscript𝑋𝑖subscript𝑈𝑖1𝑒subscript𝑋𝑖subscript𝑈𝑖^𝑒subscript𝑋𝑖1^𝑒subscript𝑋𝑖𝜆1/\lambda\leq\frac{e(X_{i},U_{i})/(1-e(X_{i},U_{i}))}{\hat{e}(X_{i})/(1-\hat{e% }(X_{i}))}\leq\lambda,1 / italic_λ ≤ divide start_ARG italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) / ( 1 - italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) end_ARG start_ARG over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) / ( 1 - over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) end_ARG ≤ italic_λ ,

where e(Xi,Ui)𝑒subscript𝑋𝑖subscript𝑈𝑖e(X_{i},U_{i})italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) refers to the true propensity score measuring all covariates, and e^(Xi)^𝑒subscript𝑋𝑖\hat{e}(X_{i})over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) refers to the estimated MAR-based propensity score. The single parameter λ𝜆\lambdaitalic_λ, that is, the odds ratio (OR) between true propensity score and estimated propensity score, can control degree of unconfoundedness. When λ=1𝜆1\lambda=1italic_λ = 1, the inclusion of additional confounders has no effect on the treatment odds. This implies that the allocation of the treatment is not influenced by confounding factors. That is, the SITA assumption holds. Increasing λ𝜆\lambdaitalic_λ represents the allowance for stronger extent to which the SITA assumption is violated. Tan 19 proposed a sensitivity analysis method to assess how the estimates based on the nonparametric likelihood change under the violation of the SITA assumption.

By introducing the marginal sensitivity model, the sensitivity analysis for unmeasured confounders can be applied to the IPW estimator under the MNAR. If U𝑈Uitalic_U was observed, one can estimate μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT with

(1ni=1nZie(Xi,Ui))11ni=1nZiYie(Xi,Ui).superscript1𝑛superscriptsubscript𝑖1𝑛subscript𝑍𝑖𝑒subscript𝑋𝑖subscript𝑈𝑖11𝑛superscriptsubscript𝑖1𝑛subscript𝑍𝑖subscript𝑌𝑖𝑒subscript𝑋𝑖subscript𝑈𝑖\left(\frac{1}{n}\sum_{i=1}^{n}\frac{Z_{i}}{e(X_{i},U_{i})}\right)^{-1}\frac{1% }{n}\sum_{i=1}^{n}\frac{Z_{i}Y_{i}}{e(X_{i},U_{i})}.( divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG . (8)

In practice, U𝑈{U}italic_U is unobserved and μ^1subscript^𝜇1\hat{\mu}_{1}over^ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT in equation (8) actually makes no sense. However, under the marginal sensitivity model, λ𝜆\lambdaitalic_λ can link the unobserved true propensity score and the estimated MAR-based propensity score, so that it is possible to evaluate bounds of (8) under some constraints. That is

maxodermin(1ni=1nZie(Xi,Ui))11ni=1nZiYie(Xi,Ui)subject to1/λ1e(Xi,Ui)/(1e(Xi,Ui))e^(Xi)/(1e^(Xi))λ1,\begin{split}\max\mbox{or}\min&\left(\frac{1}{n}\sum_{i=1}^{n}\frac{Z_{i}}{e(X% _{i},U_{i})}\right)^{-1}\frac{1}{n}\sum_{i=1}^{n}\frac{Z_{i}Y_{i}}{e(X_{i},U_{% i})}\\ \mbox{subject to}\quad&1/\lambda^{1}\leq\frac{e(X_{i},U_{i})/(1-e(X_{i},U_{i})% )}{\hat{e}(X_{i})/(1-\hat{e}(X_{i}))}\leq\lambda^{1},\end{split}start_ROW start_CELL roman_max or roman_min end_CELL start_CELL ( divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG end_CELL end_ROW start_ROW start_CELL subject to end_CELL start_CELL 1 / italic_λ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ≤ divide start_ARG italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) / ( 1 - italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) end_ARG start_ARG over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) / ( 1 - over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) end_ARG ≤ italic_λ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , end_CELL end_ROW (9)

where λ1superscript𝜆1\lambda^{1}italic_λ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT is the pre-specified constant, which describes the upper and lower bounds of the discrepancy of the true propensity score from the estimated MAR-based propensity score for the estimation of μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. As long as the true propensity score for all the subjects satisfies the constraint, the true μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT should be bounded by the minimum and maximum of (9) asymptotically. It is possible to have an interval for μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT in a similar way. This method under the marginal sensitivity model was proposed firstly by Zhao.18 In this method, the sensitivity parameter λ𝜆\lambdaitalic_λ quantifies the extent to which the SITA assumption is violated. However, it still suffers from defining a plausible range for the sensitivity parameter and reliance on correct specification of the MAR-based propensity score model.

It was criticized that the interval obtained by (9) may not be tight and the interval was asymptotically conservative.20 Dorn and Guo 20 proposed the quantile balancing method, a refinement based on the marginal sensitivity model. Let F(yx,z)=P(YyX=x,Z=z)F(y\mid x,z)=P(Y\leq y\mid X=x,Z=z)italic_F ( italic_y ∣ italic_x , italic_z ) = italic_P ( italic_Y ≤ italic_y ∣ italic_X = italic_x , italic_Z = italic_z ) and the quantile function is defined by Qt(x,z)=inf{q:F(qx,z)t}subscript𝑄𝑡𝑥𝑧infconditional-set𝑞𝐹conditional𝑞𝑥𝑧𝑡Q_{t}(x,z)=\mbox{inf}\{q:F(q\mid x,z)\geq t\}italic_Q start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , italic_z ) = inf { italic_q : italic_F ( italic_q ∣ italic_x , italic_z ) ≥ italic_t }. For bounding μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, the quantile balancing method solves the following optimization problem:

maxoderminoder\displaystyle\max\mbox{or}\minroman_max or roman_min (1ni=1nZie(Xi,Ui))11ni=1nZiYie(Xi,Ui)superscript1𝑛superscriptsubscript𝑖1𝑛subscript𝑍𝑖𝑒subscript𝑋𝑖subscript𝑈𝑖11𝑛superscriptsubscript𝑖1𝑛subscript𝑍𝑖subscript𝑌𝑖𝑒subscript𝑋𝑖subscript𝑈𝑖\displaystyle\left(\frac{1}{n}\sum_{i=1}^{n}\frac{Z_{i}}{e(X_{i},U_{i})}\right% )^{-1}\frac{1}{n}\sum_{i=1}^{n}\frac{Z_{i}Y_{i}}{e(X_{i},U_{i})}( divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG (10)
subject to i=1n(1Q^τ(Xi,1))(Zie(Xi,Ui)Zie^(Xi))=0superscriptsubscript𝑖1𝑛1missing-subexpressionsubscript^𝑄𝜏subscript𝑋𝑖1missing-subexpressionsubscript𝑍𝑖𝑒subscript𝑋𝑖subscript𝑈𝑖subscript𝑍𝑖^𝑒subscript𝑋𝑖0\displaystyle\sum_{i=1}^{n}\left(\begin{array}[]{cc}1\\ \hat{Q}_{\tau}(X_{i},1)\end{array}\right)\left(\frac{Z_{i}}{e(X_{i},U_{i})}-% \frac{Z_{i}}{\hat{e}(X_{i})}\right)=0∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( start_ARRAY start_ROW start_CELL 1 end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , 1 ) end_CELL start_CELL end_CELL end_ROW end_ARRAY ) ( divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG - divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) = 0 (13)
1/λ1e(Xi,Ui)/(1e(Xi,Ui))e^(Xi)/(1e^(Xi))λ1,1superscript𝜆1𝑒subscript𝑋𝑖subscript𝑈𝑖1𝑒subscript𝑋𝑖subscript𝑈𝑖^𝑒subscript𝑋𝑖1^𝑒subscript𝑋𝑖superscript𝜆1\displaystyle 1/\lambda^{1}\leq\frac{e(X_{i},U_{i})/(1-e(X_{i},U_{i}))}{\hat{e% }(X_{i})/(1-\hat{e}(X_{i}))}\leq\lambda^{1},1 / italic_λ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ≤ divide start_ARG italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) / ( 1 - italic_e ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) end_ARG start_ARG over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) / ( 1 - over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) end_ARG ≤ italic_λ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , (14)

where τ=λ11+λ1𝜏superscript𝜆11superscript𝜆1\tau=\frac{\lambda^{1}}{1+\lambda^{1}}italic_τ = divide start_ARG italic_λ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_ARG start_ARG 1 + italic_λ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_ARG and Q^τ(Xi,1)subscript^𝑄𝜏subscript𝑋𝑖1\hat{Q}_{\tau}(X_{i},1)over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , 1 ) is estimated with some quantile regression models.20 Bounding μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and the ATE can be achieved in a similar way. The quantile balancing method refined Zhao’s sensitivity analysis method 18 by adding the quantile function to balance the treatment assignment Z𝑍Zitalic_Z over the true propensity score at population level. This additional constraint based on the estimated quantile function ensured asymptotic optimality of the interval obtained by solving (10). Although it solves asymptotic conservativeness in Zhao’s method,18 it still suffers from the misspecification of the estimated MAR-based propensity score. Moreover, the quantile function also requires specifying some parametric models or machine learning-related methods.

4 The Proposed sensitivity analysis method

We begin with the bound for μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Let e1(Xi,Ui)superscript𝑒1subscript𝑋𝑖subscript𝑈𝑖e^{1}(X_{i},U_{i})italic_e start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) denote the true propensity score for subjects in the treated group. Let us consider to construct the upper bound of μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT by solving the following optimization problem:

μ¯1+=max\displaystyle\bar{\mu}_{1}^{+}=\quad\max\quadover¯ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT = roman_max 1ni=1nZiYie1(Xi,Ui)1𝑛superscriptsubscript𝑖1𝑛subscript𝑍𝑖subscript𝑌𝑖superscript𝑒1subscript𝑋𝑖subscript𝑈𝑖\displaystyle\frac{1}{n}\sum_{i=1}^{n}\frac{Z_{i}Y_{i}}{e^{1}(X_{i},U_{i})}divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_e start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG (15)
subject to δe1(Xi,Ui)1δ𝛿superscript𝑒1subscript𝑋𝑖subscript𝑈𝑖1𝛿\displaystyle\delta\leq e^{1}(X_{i},U_{i})\leq 1-\deltaitalic_δ ≤ italic_e start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≤ 1 - italic_δ (16)
i=1ng(Xi)(1Zie1(Xi,Ui))=0.superscriptsubscript𝑖1𝑛𝑔subscript𝑋𝑖1subscript𝑍𝑖superscript𝑒1subscript𝑋𝑖subscript𝑈𝑖0\displaystyle\sum_{i=1}^{n}g(X_{i})\left(1-\frac{Z_{i}}{e^{1}(X_{i},U_{i})}% \right)=0.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 - divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_e start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) = 0 . (17)

In a similar way, to obtain the lower bound of μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, let us consider the following problem:

μ¯1=min\displaystyle\bar{\mu}_{1}^{-}=\quad\min\quadover¯ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT = roman_min 1ni=1nZiYie1(Xi,Ui)1𝑛superscriptsubscript𝑖1𝑛subscript𝑍𝑖subscript𝑌𝑖superscript𝑒1subscript𝑋𝑖subscript𝑈𝑖\displaystyle\frac{1}{n}\sum_{i=1}^{n}\frac{Z_{i}Y_{i}}{e^{1}(X_{i},U_{i})}divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_e start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG (18)
subject to δe1(Xi,Ui)1δ𝛿superscript𝑒1subscript𝑋𝑖subscript𝑈𝑖1𝛿\displaystyle\delta\leq e^{1}(X_{i},U_{i})\leq 1-\deltaitalic_δ ≤ italic_e start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≤ 1 - italic_δ (19)
i=1ng(Xi)(1Zie1(Xi,Ui))=0.superscriptsubscript𝑖1𝑛𝑔subscript𝑋𝑖1subscript𝑍𝑖superscript𝑒1subscript𝑋𝑖subscript𝑈𝑖0\displaystyle\sum_{i=1}^{n}g(X_{i})\left(1-\frac{Z_{i}}{e^{1}(X_{i},U_{i})}% \right)=0.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 - divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_e start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) = 0 . (20)

The constraints (16) and (19) come from the positivity assumption (Assumption 2.1), which is a fundamental assumption in causal inference. We regard δ𝛿\deltaitalic_δ in (16) and (18) as a sensitivity parameter. The constraints (17) and (20) come from the estimating equation for the outcome-dependent propensity score (6). As mentioned, the estimating equation (6) cannot necessarily identify the true propensity score model uniquely from a parametric model. However, according to the law of large number, it holds that

i=1ng(Xi)(1Zie1(Xi,Ui))𝑝E[g(X)(1Ze1(X,U))]=0.𝑝superscriptsubscript𝑖1𝑛𝑔subscript𝑋𝑖1subscript𝑍𝑖superscript𝑒1subscript𝑋𝑖subscript𝑈𝑖𝐸delimited-[]𝑔𝑋1𝑍superscript𝑒1𝑋𝑈0\sum_{i=1}^{n}g(X_{i})\left(1-\frac{Z_{i}}{e^{1}(X_{i},U_{i})}\right)% \xrightarrow{p}E\left[g(X)\left(1-\frac{Z}{e^{1}(X,U)}\right)\right]=0.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 - divide start_ARG italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_e start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) start_ARROW overitalic_p → end_ARROW italic_E [ italic_g ( italic_X ) ( 1 - divide start_ARG italic_Z end_ARG start_ARG italic_e start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X , italic_U ) end_ARG ) ] = 0 . (21)

Then, the true propensity scores should satisfy the constraints (17) and (20) asymptotically, and therefore, μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT should be included in the interval [μ1,μ1+]superscriptsubscript𝜇1superscriptsubscript𝜇1[\mu_{1}^{-},\mu_{1}^{+}][ italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT , italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ] asymptotically. The bound for μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is the interval [μ¯1+,μ¯1]superscriptsubscript¯𝜇1superscriptsubscript¯𝜇1[\bar{\mu}_{1}^{+},\bar{\mu}_{1}^{-}][ over¯ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT , over¯ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ], which contains the true μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT.

Let us consider the inverse of the true propensity score, denoted by wi1=(e1(Xi,Ui))1superscriptsubscript𝑤𝑖1superscriptsuperscript𝑒1subscript𝑋𝑖subscript𝑈𝑖1w_{i}^{1}=(e^{1}(X_{i},U_{i}))^{-1}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT = ( italic_e start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, as the decision variable. Then, optimization problems (15) and (18) become a linear programming problem with linear constraints:

minodermaxoder\displaystyle\quad\min\mbox{or}\max\quadroman_min or roman_max 1ni=1nZiYiwi11𝑛superscriptsubscript𝑖1𝑛subscript𝑍𝑖subscript𝑌𝑖superscriptsubscript𝑤𝑖1\displaystyle\frac{1}{n}\sum_{i=1}^{n}Z_{i}Y_{i}w_{i}^{1}divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT (22)
subject to 11δwi11δ11𝛿superscriptsubscript𝑤𝑖11𝛿\displaystyle\frac{1}{1-\delta}\leq w_{i}^{1}\leq\frac{1}{\delta}divide start_ARG 1 end_ARG start_ARG 1 - italic_δ end_ARG ≤ italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG (23)
i=1ng(Xi)(1Ziwi1)=0.superscriptsubscript𝑖1𝑛𝑔subscript𝑋𝑖1subscript𝑍𝑖superscriptsubscript𝑤𝑖10\displaystyle\sum_{i=1}^{n}g(X_{i})(1-Z_{i}w_{i}^{1})=0.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 - italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) = 0 . (24)

Compared to the quantile balancing method, which is nonlinear optimization and requires estimation of the quantile functions, our proposal can be solved time-efficiently with the interior-point method or the simplex algorithm for the linear programming and then tractable with standard software for mathematical programming.

The bound for μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT can be constructed in a similar way as follows. Let e0(Xi,Ui)superscript𝑒0subscript𝑋𝑖subscript𝑈𝑖e^{0}(X_{i},U_{i})italic_e start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) denote the true propensity score for subjects in the control group and similarly consider the weight wi0=(1e0(Xi,Ui))1superscriptsubscript𝑤𝑖0superscript1superscript𝑒0subscript𝑋𝑖subscript𝑈𝑖1w_{i}^{0}=(1-e^{0}(X_{i},U_{i}))^{-1}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT = ( 1 - italic_e start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT as the decision variable. Then the interval [μ¯0,μ¯0+]superscriptsubscript¯𝜇0superscriptsubscript¯𝜇0[\bar{\mu}_{0}^{-},\bar{\mu}_{0}^{+}][ over¯ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT , over¯ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ] can be obtained by solving the following linear programming problem:

minodermaxoder\displaystyle\quad\min\mbox{or}\max\quadroman_min or roman_max 1ni=1n(1Zi)Yiwi01𝑛superscriptsubscript𝑖1𝑛1subscript𝑍𝑖subscript𝑌𝑖superscriptsubscript𝑤𝑖0\displaystyle\frac{1}{n}\sum_{i=1}^{n}(1-Z_{i})Y_{i}w_{i}^{0}divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( 1 - italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT (25)
subject to 11δwi01δ11𝛿superscriptsubscript𝑤𝑖01𝛿\displaystyle\frac{1}{1-\delta}\leq w_{i}^{0}\leq\frac{1}{\delta}divide start_ARG 1 end_ARG start_ARG 1 - italic_δ end_ARG ≤ italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG (26)
i=1ng(Xi)(1(1Zi)wi0)=0.superscriptsubscript𝑖1𝑛𝑔subscript𝑋𝑖11subscript𝑍𝑖superscriptsubscript𝑤𝑖00\displaystyle\sum_{i=1}^{n}g(X_{i})(1-(1-Z_{i})w_{i}^{0})=0.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 - ( 1 - italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ) = 0 . (27)

We obtain bounds for ψ𝜓\psiitalic_ψ by [μ¯1μ¯0+,μ¯1+μ¯0]superscriptsubscript¯𝜇1superscriptsubscript¯𝜇0superscriptsubscript¯𝜇1superscriptsubscript¯𝜇0[\bar{\mu}_{1}^{-}-\bar{\mu}_{0}^{+},\bar{\mu}_{1}^{+}-\bar{\mu}_{0}^{-}][ over¯ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT - over¯ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT , over¯ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT - over¯ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ].

Generally, in the estimation of propensity score, the dimension of g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) should be equal to the number of unknown parameters in the parametric model for the propensity score. In the proposed sensitivity analysis method, one can impose more constraints by increasing the dimension of g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), thereby yielding a narrower bound obtained by the linear programming problems (22) and (25). g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) can be any function of Xisubscript𝑋𝑖X_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Suppose that there are K𝐾Kitalic_K covariates: X=(Xi,1,Xi,2,,Xi,K)superscript𝑋topsubscript𝑋𝑖1subscript𝑋𝑖2subscript𝑋𝑖𝐾X^{\top}=(X_{i,1},X_{i,2},\dots,X_{i,K})italic_X start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT = ( italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_i , italic_K end_POSTSUBSCRIPT ). Then, g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) can be like

g(Xi)=(1Xi,1Xi,K)oderg(Xi)=(1Xi,1Xi,KXi,12Xi,K2)oderg(X)=(1Xi,1Xi,KXi,12Xi,K2).formulae-sequence𝑔subscript𝑋𝑖1missing-subexpressionmissing-subexpressionmissing-subexpressionsubscript𝑋𝑖1missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionsubscript𝑋𝑖𝐾missing-subexpressionmissing-subexpressionmissing-subexpressionoderformulae-sequence𝑔subscript𝑋𝑖1missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionsubscript𝑋𝑖1missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionsubscript𝑋𝑖𝐾missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionsuperscriptsubscript𝑋𝑖12missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionsuperscriptsubscript𝑋𝑖𝐾2missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionoder𝑔𝑋1missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionsubscript𝑋𝑖1missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionsubscript𝑋𝑖𝐾missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionsuperscriptsubscript𝑋𝑖12missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionsuperscriptsubscript𝑋𝑖𝐾2missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressiong(X_{i})=\left(\begin{array}[]{cccc}1\\ X_{i,1}\\ \vdots\\ X_{i,K}\end{array}\right)\quad\mbox{or}\quad g(X_{i})=\left(\begin{array}[]{% ccccccc}1\\ X_{i,1}\\ \vdots\\ X_{i,K}\\ X_{i,1}^{2}\\ \vdots\\ X_{i,K}^{2}\end{array}\right)\quad\mbox{or}\quad g(X)=\left(\begin{array}[]{% cccccccc}1\\ X_{i,1}\\ \vdots\\ X_{i,K}\\ X_{i,1}^{2}\\ \vdots\\ X_{i,K}^{2}\\ \vdots\end{array}\right).italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = ( start_ARRAY start_ROW start_CELL 1 end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_X start_POSTSUBSCRIPT italic_i , italic_K end_POSTSUBSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW end_ARRAY ) or italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = ( start_ARRAY start_ROW start_CELL 1 end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_X start_POSTSUBSCRIPT italic_i , italic_K end_POSTSUBSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_X start_POSTSUBSCRIPT italic_i , italic_K end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW end_ARRAY ) or italic_g ( italic_X ) = ( start_ARRAY start_ROW start_CELL 1 end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_X start_POSTSUBSCRIPT italic_i , italic_K end_POSTSUBSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_X start_POSTSUBSCRIPT italic_i , italic_K end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW end_ARRAY ) . (28)

As long as the resulting constraints give us feasible solutions to the optimization problems, the proposed method is expected to narrow the bound by simply increasing the dimension of g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), since greater flexibility on the choice of g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is allowed.

The IPW estimators (2) and (3) do not satisfy the population boundedness property: the IPW estimator can be beyond the range of the outcome.29, 30 On the other hand, the SIPW estimators (4) and (5) satisfy it. The objective functions (15) and (18) have the form of the IPW estimator. If we set the first element of g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) to be 1, as seen in (28), the IPW estimator agrees with the SIPW estimator. Consequently, it is sufficient to consider using a more computationally tractable, unstabilized form as the objective function and suggested to consider 1 as the first element of g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). Dorn and Guo 20 also considered the condition (21), but criticized that g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) should involve infinitely many moment conditions. Coupled with the constraint (14), they showed that the infinitely many constraints can be replaced with a single constraint of the quantile balancing (13). This simplification with the quantile balancing is realized with the OR-based constraint (14). In practical sensitivity analysis of observational studies, the bounds for the ATE obtained by optimizing (15) and (18) are generally compared with a specific threshold, such as zero, to ensure the robustness of the results. Therefore, there is no need to introduce an infinite number of constraints, as it is sufficient to increase the dimension of g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) to ensure the robustness of the causal inference in an observational study. In addition, the quantile function must be estimated and then be subject to assumptions in modeling and estimation, although Dorn and Guo 20 tried to minimize the risk of misspecification by introducing flexible models. The authors provides several machine learning-related estimation methods, which might yield notably different bounds from each other in their simulation study.20 One advantage of the proposed method is that it does not rely on messy estimation in the quantile regression. Simply by increasing dimension of g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), we can try to make the bound narrower. Furthermore, it is more crucial that our method can provide bounds without relying of the condition (14), λ𝜆\lambdaitalic_λ in which is hard to specify.

On the other hand, the bounds may be wide without (14) and may not give any meaningful information. If this is the case, the constraint (14) can be incorporated into the optimization problems (22) and (25) as follows:

minodermaxoder\displaystyle\quad\min\mbox{or}\max\quadroman_min or roman_max 1ni=1nZiYiwi11𝑛superscriptsubscript𝑖1𝑛subscript𝑍𝑖subscript𝑌𝑖superscriptsubscript𝑤𝑖1\displaystyle\frac{1}{n}\sum_{i=1}^{n}Z_{i}Y_{i}w_{i}^{1}divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT (29)
subject to 11δwi11δ11𝛿superscriptsubscript𝑤𝑖11𝛿\displaystyle\frac{1}{1-\delta}\leq w_{i}^{1}\leq\frac{1}{\delta}divide start_ARG 1 end_ARG start_ARG 1 - italic_δ end_ARG ≤ italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG
i=1ng(Xi)(1Ziwi1)=0superscriptsubscript𝑖1𝑛𝑔subscript𝑋𝑖1subscript𝑍𝑖superscriptsubscript𝑤𝑖10\displaystyle\sum_{i=1}^{n}g(X_{i})(1-Z_{i}w_{i}^{1})=0∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 - italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) = 0
1+(λ11)e^(Xi)λ1e^(Xi)wi11+e^(Xi)(1/λ11)e^(Xi)(1/λ1).1superscript𝜆11^𝑒subscript𝑋𝑖superscript𝜆1^𝑒subscript𝑋𝑖superscriptsubscript𝑤𝑖11^𝑒subscript𝑋𝑖1superscript𝜆11^𝑒subscript𝑋𝑖1superscript𝜆1\displaystyle\frac{1+(\lambda^{1}-1)\hat{e}(X_{i})}{\lambda^{1}\hat{e}(X_{i})}% \leq w_{i}^{1}\leq\frac{1+\hat{e}(X_{i})(1/\lambda^{1}-1)}{\hat{e}(X_{i})(1/% \lambda^{1})}.divide start_ARG 1 + ( italic_λ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT - 1 ) over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG start_ARG italic_λ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ≤ italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ≤ divide start_ARG 1 + over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 / italic_λ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT - 1 ) end_ARG start_ARG over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 / italic_λ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) end_ARG . (30)

The additional constraint (30) is from the marginal sensitivity model (9), in which e^(Xi)^𝑒subscript𝑋𝑖\hat{e}(X_{i})over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) refers to estimated MAR-based propensity score depending merely on measured covariates, and λ1superscript𝜆1\lambda^{1}italic_λ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT refers to the OR between true propensity score and the estimated MAR-based propensity score. Note that, after introducing the third constraint (30), the optimization problem remains a linear programming problem. With g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) fixed, the addition of the constraint (30) can further narrow the bound obtained by solving the linear programming (29). The bound for μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT can be obtained in a similar way:

minodermax1ni=1n(1Zi)Yiwi0subject to11δwi01δi=1ng(Xi)(1(1Zi)wi0)=0e^(Xi)λ0(1e^(Xi))+1wi0λ0e^(Xi)1e^(Xi)+1.oder1𝑛superscriptsubscript𝑖1𝑛1subscript𝑍𝑖subscript𝑌𝑖superscriptsubscript𝑤𝑖0subject to11𝛿superscriptsubscript𝑤𝑖01𝛿superscriptsubscript𝑖1𝑛𝑔subscript𝑋𝑖11subscript𝑍𝑖superscriptsubscript𝑤𝑖00^𝑒subscript𝑋𝑖superscript𝜆01^𝑒subscript𝑋𝑖1superscriptsubscript𝑤𝑖0superscript𝜆0^𝑒subscript𝑋𝑖1^𝑒subscript𝑋𝑖1\begin{split}\quad\min\mbox{or}\max\quad&\frac{1}{n}\sum_{i=1}^{n}(1-Z_{i})Y_{% i}w_{i}^{0}\\ \mbox{subject to}\quad&\frac{1}{1-\delta}\leq w_{i}^{0}\leq\frac{1}{\delta}\\ &\sum_{i=1}^{n}g(X_{i})(1-(1-Z_{i})w_{i}^{0})=0\\ &\frac{\hat{e}(X_{i})}{\lambda^{0}(1-\hat{e}(X_{i}))}+1\leq w_{i}^{0}\leq\frac% {\lambda^{0}\hat{e}(X_{i})}{1-\hat{e}(X_{i})}+1.\end{split}start_ROW start_CELL roman_min or roman_max end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( 1 - italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL subject to end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG 1 - italic_δ end_ARG ≤ italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 - ( 1 - italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ) = 0 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL divide start_ARG over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG start_ARG italic_λ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( 1 - over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) end_ARG + 1 ≤ italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ≤ divide start_ARG italic_λ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG start_ARG 1 - over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG + 1 . end_CELL end_ROW (31)

As done by Dorn and Guo,20 estimation error for the bounds can be accounted by using the bootstrap confidence intervals of the lower and upper bounds. One may hope to make the bounds tighter by introducing g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) of the higher dimension. A concern is putting more variables may lead unreliable bounds of less stability. The bootstrap samples would also be useful to evaluate how stable the resulting bounds are: if the number of the bootstrap samples of feasible solutions of the linear programming is small, the resulting bounds should be carefully interpreted.

5 Simulation Study

In this section, we investigate the performance of the proposed method and compare it with Dorn and Guo’s method20 based on the marginal sensitivity model and quantile balancing over several simulated datasets. The simulation settings followed Morikawa and Kim’s framework,31 allowing us to evaluate the performance of the proposed method when encountering unidentifiability issues. In our simulation, we considered to generate five covariates X¯i=(Xi,1,Xi,2,Xi,3,Xi,4,Xi,5)superscriptsubscript¯𝑋𝑖topsubscript𝑋𝑖1subscript𝑋𝑖2subscript𝑋𝑖3subscript𝑋𝑖4subscript𝑋𝑖5\bar{X}_{i}^{\top}=(X_{i,1},X_{i,2},X_{i,3},X_{i,4},X_{i,5})over¯ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT = ( italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 3 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 4 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 5 end_POSTSUBSCRIPT ) from the normal distribution with

Xi,1𝒩(0,1)Xi,k+1Xi,k=xi,k𝒩(xi,k3,1),k=1,2,3,4.\begin{split}X_{i,1}&\sim\mathcal{N}\left(0,1\right)\\ X_{i,k+1}\mid X_{i,k}=x_{i,k}&\sim\mathcal{N}\left(\frac{-x_{i,k}}{3},1\right)% ,\quad k=1,2,3,4.\end{split}start_ROW start_CELL italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT end_CELL start_CELL ∼ caligraphic_N ( 0 , 1 ) end_CELL end_ROW start_ROW start_CELL italic_X start_POSTSUBSCRIPT italic_i , italic_k + 1 end_POSTSUBSCRIPT ∣ italic_X start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT end_CELL start_CELL ∼ caligraphic_N ( divide start_ARG - italic_x start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT end_ARG start_ARG 3 end_ARG , 1 ) , italic_k = 1 , 2 , 3 , 4 . end_CELL end_ROW

Here, {Xi,1,Xi,2,Xi,3,Xi,4}subscript𝑋𝑖1subscript𝑋𝑖2subscript𝑋𝑖3subscript𝑋𝑖4\{X_{i,1},X_{i,2},X_{i,3},X_{i,4}\}{ italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 3 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 4 end_POSTSUBSCRIPT } were regarded as measured covariates, while Xi,5subscript𝑋𝑖5X_{i,5}italic_X start_POSTSUBSCRIPT italic_i , 5 end_POSTSUBSCRIPT was regarded as an unmeasured confounder. By setting different a1[s]superscriptsubscript𝑎1delimited-[]𝑠a_{1}^{[s]}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT [ italic_s ] end_POSTSUPERSCRIPT in two scenarios (s=1,2)𝑠12(s=1,2)( italic_s = 1 , 2 ), where a1[1]=0.0775superscriptsubscript𝑎1delimited-[]10.0775a_{1}^{[1]}=0.0775italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT [ 1 ] end_POSTSUPERSCRIPT = 0.0775 and a1[2]=0.998superscriptsubscript𝑎1delimited-[]20.998a_{1}^{[2]}=0.998italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT [ 2 ] end_POSTSUPERSCRIPT = 0.998, the outcome was generated as follows:

μ(1)(x¯i)=a1[s]+0.4xi,1+0.4xi,2+0.6xi,1xi,2+0.5xi,30.7xi,4+0.2xi,5,superscript𝜇1subscript¯𝑥𝑖superscriptsubscript𝑎1delimited-[]𝑠0.4subscript𝑥𝑖10.4subscript𝑥𝑖20.6subscript𝑥𝑖1subscript𝑥𝑖20.5subscript𝑥𝑖30.7subscript𝑥𝑖40.2subscript𝑥𝑖5\mu^{(1)}(\bar{x}_{i})=a_{1}^{[s]}+0.4x_{i,1}+0.4x_{i,2}+0.6x_{i,1}x_{i,2}+0.5% x_{i,3}-0.7x_{i,4}+0.2x_{i,5},italic_μ start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ( over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT [ italic_s ] end_POSTSUPERSCRIPT + 0.4 italic_x start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT + 0.4 italic_x start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT + 0.6 italic_x start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT + 0.5 italic_x start_POSTSUBSCRIPT italic_i , 3 end_POSTSUBSCRIPT - 0.7 italic_x start_POSTSUBSCRIPT italic_i , 4 end_POSTSUBSCRIPT + 0.2 italic_x start_POSTSUBSCRIPT italic_i , 5 end_POSTSUBSCRIPT ,
μ(0)(x¯i)=0.0654+0.2xi,1+0.1xi,2+1.2xi,1xi,2+0.2xi,30.3xi,4+0.6xi,5,superscript𝜇0subscript¯𝑥𝑖0.06540.2subscript𝑥𝑖10.1subscript𝑥𝑖21.2subscript𝑥𝑖1subscript𝑥𝑖20.2subscript𝑥𝑖30.3subscript𝑥𝑖40.6subscript𝑥𝑖5\mu^{(0)}(\bar{x}_{i})=0.0654+0.2x_{i,1}+0.1x_{i,2}+1.2x_{i,1}x_{i,2}+0.2x_{i,% 3}-0.3x_{i,4}+0.6x_{i,5},italic_μ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = 0.0654 + 0.2 italic_x start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT + 0.1 italic_x start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT + 1.2 italic_x start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT + 0.2 italic_x start_POSTSUBSCRIPT italic_i , 3 end_POSTSUBSCRIPT - 0.3 italic_x start_POSTSUBSCRIPT italic_i , 4 end_POSTSUBSCRIPT + 0.6 italic_x start_POSTSUBSCRIPT italic_i , 5 end_POSTSUBSCRIPT ,
Yi(1)(X¯i=x¯i)𝒩(μ(1)(x¯i),14),similar-toconditionalsubscriptsuperscript𝑌1𝑖subscript¯𝑋𝑖subscript¯𝑥𝑖𝒩superscript𝜇1subscript¯𝑥𝑖14Y^{(1)}_{i}\mid(\bar{X}_{i}=\bar{x}_{i})\sim\mathcal{N}\left(\mu^{(1)}(\bar{x}% _{i}),\frac{1}{4}\right),italic_Y start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ ( over¯ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∼ caligraphic_N ( italic_μ start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ( over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , divide start_ARG 1 end_ARG start_ARG 4 end_ARG ) ,
Yi(0)(X¯i=x¯i)𝒩(μ(0)(x¯i),14).similar-toconditionalsubscriptsuperscript𝑌0𝑖subscript¯𝑋𝑖subscript¯𝑥𝑖𝒩superscript𝜇0subscript¯𝑥𝑖14Y^{(0)}_{i}\mid(\bar{X}_{i}=\bar{x}_{i})\sim\mathcal{N}\left(\mu^{(0)}(\bar{x}% _{i}),\frac{1}{4}\right).italic_Y start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ ( over¯ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∼ caligraphic_N ( italic_μ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , divide start_ARG 1 end_ARG start_ARG 4 end_ARG ) .

The treatment assignment Zi{0,1}subscript𝑍𝑖01Z_{i}\in\{0,1\}italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ { 0 , 1 } was generated by the Bernoulli distribution with

P(Zi=1X¯i=x¯i,Yi(1)=yi(1))=11+exp(0.904+0.5xi,1+0.5xi,2+0.5xi,30.2xi,4xi,5+0.3yi(1)).P(Z_{i}=1\mid\bar{X}_{i}=\bar{x}_{i},Y^{(1)}_{i}=y^{(1)}_{i})=\frac{1}{1+\exp{% (-0.904+0.5x_{i,1}+0.5x_{i,2}+0.5x_{i,3}-0.2x_{i,4}-x_{i,5}+0.3y^{(1)}_{i})}}.italic_P ( italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 ∣ over¯ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_y start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG 1 + roman_exp ( - 0.904 + 0.5 italic_x start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT + 0.5 italic_x start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT + 0.5 italic_x start_POSTSUBSCRIPT italic_i , 3 end_POSTSUBSCRIPT - 0.2 italic_x start_POSTSUBSCRIPT italic_i , 4 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , 5 end_POSTSUBSCRIPT + 0.3 italic_y start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG .

We simulated 1,000 observational studies with n=1,000𝑛1000n=1,000italic_n = 1 , 000 subjects for each scenario. For each simulated study, the bounds for the ATE were calculated by our proposed method, as well as the quantile balancing method.20 In applying the quantile balancing method, the linear quantile regression on {Xi,1,Xi,2,Xi,3,Xi,4}subscript𝑋𝑖1subscript𝑋𝑖2subscript𝑋𝑖3subscript𝑋𝑖4\{X_{i,1},X_{i,2},X_{i,3},X_{i,4}\}{ italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 3 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 4 end_POSTSUBSCRIPT } was applied. For the constraint (14), we estimated the MAR-based propensity score e^(Xi)^𝑒subscript𝑋𝑖\hat{e}(X_{i})over^ start_ARG italic_e end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) by the logistic regression model with {Xi,1,Xi,2,Xi,3,Xi,4}subscript𝑋𝑖1subscript𝑋𝑖2subscript𝑋𝑖3subscript𝑋𝑖4\{X_{i,1},X_{i,2},X_{i,3},X_{i,4}\}{ italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 3 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i , 4 end_POSTSUBSCRIPT } and applied 5-fold cross-fitting in the estimation of the quantile function. The application of the quantile balancing method was conducted by utilizing the R package provided by Dorn and Guo.20 The proposed method with (22) and (25) did not involve any estimation of the MAR-based propensity score. The OPTMODEL Procedure of SAS (SAS Institute Inc, Cary, North Carolina) was used for solving the linear programming problems in the proposed method. We considered four settings of the specification of g(X)𝑔𝑋g(X)italic_g ( italic_X ):

  1. D1

    includes 1111, and the linear and quadratic terms for all the observed covariates;

  2. D2

    includes D1 plus all two-variable interactions;

  3. D3

    includes D1 plus the cubic terms for all the observed covariates and all interactions;

  4. D4

    includes D3 plus the quartic and quintic terms for all the observed covariates, Xi,43(Xi,3Xi,2)(Xi,1+3Xi,2/2)superscriptsubscript𝑋𝑖43subscript𝑋𝑖3subscript𝑋𝑖2subscript𝑋𝑖13subscript𝑋𝑖22X_{i,4}^{3}(X_{i,3}-X_{i,2})(X_{i,1}+3X_{i,2}/2)italic_X start_POSTSUBSCRIPT italic_i , 4 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i , 3 end_POSTSUBSCRIPT - italic_X start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ) ( italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT + 3 italic_X start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT / 2 ), and Xi,43(Xi,3Xi,2)/(Xi,1Xi,3)(Xi,1+3Xi,2/2)superscriptsubscript𝑋𝑖43subscript𝑋𝑖3subscript𝑋𝑖2subscript𝑋𝑖1subscript𝑋𝑖3subscript𝑋𝑖13subscript𝑋𝑖22X_{i,4}^{3}(X_{i,3}-X_{i,2})/(X_{i,1}-X_{i,3})(X_{i,1}+3X_{i,2}/2)italic_X start_POSTSUBSCRIPT italic_i , 4 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i , 3 end_POSTSUBSCRIPT - italic_X start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ) / ( italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT - italic_X start_POSTSUBSCRIPT italic_i , 3 end_POSTSUBSCRIPT ) ( italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT + 3 italic_X start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT / 2 ).

In the proposed method, we considered to estimate the bounds for μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT under the constraints (23) and (24) and for μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT under the constraints (26) and (27). Thus, the validity of the proposed method would be dependent on the choice of δ𝛿\deltaitalic_δ. To discuss this point, we checked the distributions of the true propensity score in the simulated datasets. In the datasets under Scenario 1, with δ=0.1,0.01𝛿0.10.01\delta=0.1,0.01italic_δ = 0.1 , 0.01, and 0.0010.0010.0010.001, 18.39%, 0.29% and 0.01% true propensity scores did not satisfy the conditions (23) and (26), respectively. The corresponding proportions for the datasets under Scenario 2 were 2.47%, 0.01% and 0.00%, respectively. Thus, with δ=0.1𝛿0.1\delta=0.1italic_δ = 0.1, the constraints seemed not to hold, whereas setting δ=0.01𝛿0.01\delta=0.01italic_δ = 0.01 or less were relevant. Table 1 shows the average of the upper and the lower bounds and the coverage probability of the bounds for the ATE in the proposed method with several settings of δ𝛿\deltaitalic_δ and g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), where the coverage probability was defined as the proportion that the lower and the upper bounds covered the true ATE. The left panels of Figure 1 and Figure 2 show the boxplots of the lower and upper bounds for the ATE when δ𝛿\deltaitalic_δ was set to 0.01 in the two scenarios, respectively. We computed the averages of Y(1)superscript𝑌1Y^{(1)}italic_Y start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT and Y(0)superscript𝑌0Y^{(0)}italic_Y start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT over all simulated datasets and regarded their subtraction as the true ATE, which is depicted by a solid horizontal line in Figure 1 and Figure 2. In Scenario 1, as shown in Table 1 and the left panel of Figure 1, the proposed method demonstrated excellent performance in terms of the coverage probability when δ𝛿\deltaitalic_δ was set to 0.01 or smaller. For Scenario 2, as shown in Table 1 and the left panel of Figure 2, the coverage probability of the bounds was even excellent when δ𝛿\deltaitalic_δ was set to 0.1. In addition, the bounds (D2, D3, and D4) in Scenario 2 effectively excluded the null.

For the quantile balancing method, the right panels of Figure 1 and Figure 2 present the boxplots of the bound for the ATE based on different ORs in the two scenarios, respectively; the detailed estimates are summarized in Table 2. As the decrease of OR, the quantile balancing method gave a narrower bound with sacrificing the coverage probability. The proposed method provided feasible solutions for all the 1,000 simulated datasets with different settings of δ𝛿\deltaitalic_δ and g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), except for one setting (Scenario 2, D4, and δ=0.1𝛿0.1\delta=0.1italic_δ = 0.1), in which the coverage probability was almost 1 among 952 simulated datasets with feasible solutions and the bounds for ATE were the narrowest. In this case, the proposed method gave the worst-case bounds with an average length of 1.09, which was less than the length of the bound obtained by assuming OR to be 2 in the quantile balancing method. In words, by increasing the dimension of the function alone, we can narrow down the length of the bound obtained by the proposed method to the level of quantile balancing method with OR specified as 2. The results comply with our expectations that (1) the worst-case bound obtained by our proposal can cover the true ATE without any additional assumptions; and (2) by increasing the dimension of g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), narrowing of our bound can be achieved.

6 Application

In this section, we apply the proposed method to a real-world data from TONE study.32 This study aimed to evaluate the effectiveness of a designated exercise program in preventing dementia among the elderly. In this study, scores in five cognitive domains (attention, memory, visuospatial function, language, and reasoning) were used to quantify the level of cognition. We considered to estimate the effectiveness of the exercise program on the attention domain, which was regarded as a continuous variable. The confounders included age, sex, education level (1/0101/01 / 0: high/low), and attention scores at the baseline.

In the primary analysis, a total of 935 participants were included, in which 234 were in the exercise program group and 701 in the control group. We utilized the IPW estimator to adjust imbalances in covariates between the exercise program and control groups. In the primary analysis, we used the logistic regression to the estimate MAR-based propensity scores, and the unknown parameters were estimated by the maximum likelihood method. Significantly large between-group imbalances were observed in age, attention scores at baseline, and education level before weighting inversely by the estimated MAR-based propensity scores. The between-group imbalances were effectively eliminated after weighting, indicating that IPW significantly enhanced balance across the two groups. The IPW point estimate of the ATE was 4.09 with a 95% confidence interval of [2.97,5.22]2.975.22[2.97,5.22][ 2.97 , 5.22 ]. Therefore, the result of primary analysis indicated a significantly positive effect of the exercise program on the improvement of the attention level. This finding was consistent with some previous randomized controlled trials.33, 34 However, a meta-analysis of observational studies35 reported an insignificantly positive result, suggesting that the robustness of the positive effects needs further confirmation. Then, a sensitivity analysis was necessary. In the sensitivity analysis, we applied our proposed method with δ=0.01𝛿0.01\delta=0.01italic_δ = 0.01 and four settings of g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ):

  1. D1

    includes 1111 and the linear term of all the covariates;

  2. D2

    includes 1111 and the linear and quadratic terms of all the covariates;

  3. D3

    includes D2 plus the interaction between age and attention score at baseline;

  4. D4

    includes D2 plus all two-variable interactions.

For the quantile balancing method, the quantile function was estimated using linear quantile regression with 5-fold cross fitting, and the MAR-based propensity scores were estimated using the logistic regression. The result by the proposed method is given in Table 3. In addition to the plain bounds, we calculated the confidence intervals of the upper and lower bounds with 1,000 bootstrap samples. BootLower and BootUpper refer to the lower and upper bounds of 95% bootstrap confidence interval for the lower and upper bounds of ATE, respectively. Feasibility refers to the number of the resampled datasets, in which feasible solutions of the linear programming in the proposed method can be obtained. At first, we examined the worst-case bounds based on the proposed method (22) without the OR-based constraints (Table 3). The resulting bounds with the four settings of g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) are presented in the row with λ=/𝜆\lambda=/italic_λ = / in Table 3. Even with g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) of a higher dimension (D4), the lower bound was less than the null value 0, indicating that the proposed methods did not eliminate concerns on unmeasured confounders. Since the length of the bounds was wide and not necessarily well interpretable, the OR-based constrains were added, and the bounds were calculated with (29) and (31). The results with λ=2,3𝜆23\lambda=2,3italic_λ = 2 , 3, and 5555 are also shown in Table 3. Our proposal indicated that the worst-case bound could exclude the null if it was supposed that the OR between the true propensity score and estimated MAR-based propensity score was at most 2. For reference, we also applied the quantile balancing method. The corresponding bounds with the OR-based constrains based on the quantile balancing method (10) are presented in Table 4. The bounds were similar to ours and, of note, when subjected to the same OR-based constraint, our bound was tighter than that of the quantile balancing method. We noticed that the smaller OR and greater complexity of g(Xi)𝑔subscript𝑋𝑖g(X_{i})italic_g ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) caused less feasible solutions of the linear programming in resampled datasets. Without the additional OR=based constraint, the results of bootstrap kept stable and feasible. However, when a small OR was assumed, occasions to have feasible solutions in resampled datasets drastically decreased. Thus, the estimating equation constraints are not necessarily compatible with the OR-based constraints, in particular when a small λ𝜆\lambdaitalic_λ is set.

By incorporating more covariates, we tried to make the bounds tighter. We further included the baseline scores of other four cognitive domains (memory, visuospatial function, language, and reasoning) as confounders in the sensitivity analysis. The results by the proposed method without the OR-based constraint (22) are shown in Table 5. The worst-case bounds for the ATE in Table 5 achieve great tightness. In some settings (D3 and D4), the worst-case bounds even excluded the null, thus indicating the robustness of the primary analysis, without any additional OR-based constraints. Nevertheless, we noticed small numbers of boostrap samples with feasible solutions. Thus, the successful exclusion of the null with D3 and D4 would be subject to instability, and it could not eliminate concerns against unmeasured confounders completely.

7 Discussion

Interest in drawing medical evidence from the real-world data has been rapidly growing, and the number of papers reporting results of real-world data analyses with the confounder adjustment has been substantially increasing. The propensity score analysis is now routinely applied in the analysis of observational studies. However, almost all the papers only reports the results of the propensity score matching and/or the IPW method by the propensity score and do not address the important issue of the unmeasured confounders. Since the issue of residual confounding is always left as a limitation in the analysis of observational studies, it is very important to develop sensitivity analysis methods which are easily applicable and rely on less assumptions. In this paper, we proposed a simple sensitivity analysis method based on the IPW method. To our best knowledge, all the existing sensitivity analysis methods for the IPW estimator rely on some untestable assumptions on the departure from the SITA assumption. Although they provide very useful tools to address potential impacts of unmeasured confounders, there are still concerns with potential violation of the assumption. Our method requires only minimal assumptions and can construct the bounds for the ATE, completely free from any quantification of the departure from the SITA assumption. Although it may not give sufficiently informative bounds of small width, showing the bounds based on minimal assumptions would be useful as a basis in addressing the potential impacts of the residual confounding. Our method can easily incorporate the OR-based constraints for the departure from the SITA assumption to give tighter bounds. The resulting method with the additional constraints corresponds to the quantile methods by Dorn and Gou.20 Our method can be easily applied with the linear programming, avoiding the estimation of the quantile functions, and empirically, gave a bit tighter bounds in our simulation study. We propose a strategy that analysts begin with the bounds without the OR-based constraints of minimal assumptions and then incorporates additional OR-based constraints if needed. Comparing with the elegant theory of the quantile balancing method by Dorn and Guo,20 our method is based on a very simple idea. We believe that our method is practical and our strategy would be useful in addressing the issue of residual confounding.

By incorporating more constraints with g(X)𝑔𝑋g(X)italic_g ( italic_X ) of higher dimension, one may have tighter bounds with our proposed method. It motivates us to collect as many potential confounders as possible in conducting observational studies. On the other hand, we do not have any clear guidance on how to define g(X)𝑔𝑋g(X)italic_g ( italic_X ). Putting more constraints would be desirable to make the bounds tighter. However, feasibility depends on sample size, and we cannot implement the linear programming without any feasible solution which depends on the choice of g(X)𝑔𝑋g(X)italic_g ( italic_X ). Establishing a practical guidance would be an important future research topic.

Our idea is simple: we remove any parametric models for the propensity score, but still rely on the estimating equation for the propensity score. This simplicity would make us easily extend the idea to more complicated problems in causal inference and missing data analysis. This also warrants addressing in future research.

Acknowledgments

This research was partly supported by Grant-in-Aid for Challenging Exploratory Research (16K12403) and for Scientific Research(16H06299, 18H03208) from the Ministry of Education, Science, Sports and Technology of Japan.

Conflict of interest

The authors declare no conflict of interests.

References

  • 1 Rubin DB. Bayesian inference for causal effects: The role of randomization. Ann Stat. 1978;6(1):34–58.
  • 2 Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55.
  • 3 Hirano K, Imbens GW. Estimation of causal effects using propensity score weighting: An application to data on right heart catheterization. Health Serv Outcomes Res Methodol. 2001;2:259–278.
  • 4 Austin PC, Stuart EA. Moving towards best practice when using inverse probability of treatment weighting (IPTW) using the propensity score to estimate causal treatment effects in observational studies. Stat Med. 2015;34(28):3661–3679.
  • 5 Lunceford JK, Davidian M. Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study. Stat Med. 2004;23(19):2937–2960.
  • 6 Hernán MA, Hernández-Díaz S, Robins JM. A structural approach to selection bias. Epidemiology. 2004;15(5):615–625.
  • 7 Petersen ML, Laan v. dMJ. Causal models and learning from data: integrating causal modeling and statistical estimation. Epidemiology (Cambridge, Mass.). 2014;25(3):418.
  • 8 Scharfstein DO, Rotnitzky A, Robins JM. Adjusting for nonignorable drop-out using semiparametric nonresponse models. J Am Stat Assoc. 1999;94(448):1096–1120.
  • 9 Miao W, Ding P, Geng Z. Identifiability of normal and normal mixture models with nonignorable missing data. J Am Stat Assoc. 2016;111(516):1673–1683.
  • 10 Rosenbaum PR, Rubin DB. Assessing sensitivity to an unobserved binary covariate in an observational study with binary outcome. J R Stat Soc Series B Stat Methodol. 1983;45(2):212–218.
  • 11 Lin DY, Psaty BM, Kronmal RA. Assessing the sensitivity of regression results to unmeasured confounders in observational studies. Biometrics. 1998;54(3):948–963.
  • 12 Cornfield J, Haenszel W, Hammond EC, Lilienfeld AM, Shimkin MB, Wynder EL. Smoking and lung cancer: recent evidence and a discussion of some questions. J Natl Cancer Inst. 1959;22(1):173–203.
  • 13 Ding P, VanderWeele TJ. Sensitivity analysis without assumptions. Epidemiology (Cambridge, Mass.). 2016;27(3):368–377.
  • 14 VanderWeele TJ, Ding P. Sensitivity analysis in observational research: introducing the E-value. Ann Intern Med. 2017;167(4):268–274.
  • 15 Li L, Shen C, Wu AC, Li X. Propensity score-based sensitivity analysis method for uncontrolled confounding. Am J Epidemiol. 2011;174(3):345–353.
  • 16 Shen C, Li X, Li L, Were MC. Sensitivity analysis for causal inference using inverse probability weighting. Biom J. 2011;53(5):822–837.
  • 17 Lu S, Ding P. Flexible sensitivity analysis for causal inference in observational studies subject to unmeasured confounding. arXiv. Preprint posted online Sun, 28 May 2023. doi: https://doi.org/10.48550/arXiv.2305.1764
  • 18 Zhao Q, Small DS, Bhattacharya BB. Sensitivity analysis for inverse probability weighting estimators via the percentile bootstrap. J R Stat Soc Series B Stat Methodol. 2019;81(4):735–761.
  • 19 Tan Z. A distributional approach for causal inference using propensity scores. J Am Stat Assoc. 2006;101(476):1619–1637.
  • 20 Dorn J, Guo K. Sharp Sensitivity Analysis for Inverse Propensity Weighting via Quantile Balancing. J Am Stat Assoc. Published online: 31 May 2022. doi: 10.1080/01621459.2022.2069572
  • 21 Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. J Educ Psychol. 1974;66(5):688–701.
  • 22 Robins JM, Rotnitzky A, Zhao LP. Estimation of regression coefficients when some regressors are not always observed. J Am Stat Assoc. 1994;89(427):846–866.
  • 23 Greenlees JS, Reece WS, Zieschang KD. Imputation of missing values when the probability of response depends on the variable being imputed. J Am Stat Assoc. 1982;77(378):251–261.
  • 24 Andrea R, Scharfstein D, Su TL, Robins J. Methods for conducting sensitivity analysis of trials with potentially nonignorable competing causes of censoring. Biometrics. 2001;57(1):103–113.
  • 25 Verbeke G, Molenberghs G, Thijs H, Lesaffre E, Kenward MG. Sensitivity analysis for nonrandom dropout: a local influence approach. Biometrics. 2001;57(1):7–14.
  • 26 Robins JM. Association, causation, and marginal structural models. Synthese. 1999;121(1/2):151–179.
  • 27 Robins JM, Rotnitzky A, Scharfstein DO. Sensitivity analysis for selection bias and unmeasured confounding in missing data and causal inference models. In: Halloran ME, Berry D. , eds. Statistical models in epidemiology, the environment, and clinical trials, New York, NY: Springer, 2000:1–94.
  • 28 Brumback BA, Hernán MA, Haneuse SJ, Robins JM. Sensitivity analyses for unmeasured confounding assuming a marginal structural model for repeated measures. Stat Med. 2004;23(5):749–767.
  • 29 Tan Z. Bounded, efficient and doubly robust estimation with inverse weighting. Biometrika. 2010;97(3):661–682.
  • 30 Robins J, Sued M, Lei-Gomez Q, Rotnitzky A. Comment: Performance of double-robust estimators when" inverse probability" weights are highly variable. Stat Sci. 2007;22(4):544–559.
  • 31 Morikawa K, Kim JK. Semiparametric optimal estimation with nonignorable nonresponse data. Ann Stat. 2021;49(5):2991–3014.
  • 32 Sasaki M, Kodama C, Hidaka S, et al. Prevalence of four subtypes of mild cognitive impairment and APOE in a Japanese community. Int J Geriatr Psychiatry. 2009;24(10):1119–1126.
  • 33 Sanders L, Hortobágyi T, Karssemeijer E, Zee V. dE, Scherder E, Van Heuvelen M. Effects of low-and high-intensity physical exercise on physical and cognitive function in older persons with dementia: a randomized controlled trial. Alzheimers Res Ther. 2020;12(1):28.
  • 34 Lamb SE, Sheehan B, Atherton N, et al. Dementia And Physical Activity (DAPA) trial of moderate to high intensity exercise training for people with dementia: randomised controlled trial. BMJ. 2018;361:k1675. doi: 10.1136/bmj.k1675
  • 35 Demurtas J, Schoene D, Torbahn G, et al. Physical activity and exercise in mild cognitive impairment and dementia: an umbrella review of intervention and observational studies. J Am Med Dir Assoc. 2020;21(10):1415–1422.
Table 1: Summary of the proposed method with several settings of δ𝛿\deltaitalic_δ and g(X)𝑔𝑋g(X)italic_g ( italic_X ) over 1,000 simulated datasets in two scenarios.
Scenario 1 (True ATE: 0.21) Scenario 2 (True ATE: 1.13)
g(X)𝑔𝑋g(X)italic_g ( italic_X ) δ𝛿\deltaitalic_δ Bound*{}^{*}start_FLOATSUPERSCRIPT * end_FLOATSUPERSCRIPT Length**absent{}^{**}start_FLOATSUPERSCRIPT * * end_FLOATSUPERSCRIPT Coverage***absent{}^{***}start_FLOATSUPERSCRIPT * * * end_FLOATSUPERSCRIPT Bound Length Coverage
D1 0.1a𝑎{}^{a}start_FLOATSUPERSCRIPT italic_a end_FLOATSUPERSCRIPT [-0.44,1.41] 1.85 1.00 [0.32,2.41] 2.09 1.00
0.01b𝑏{}^{b}start_FLOATSUPERSCRIPT italic_b end_FLOATSUPERSCRIPT [-1.36,2.12] 3.48 1.00 [-0.60,3.09] 3.68 1.00
0.001c𝑐{}^{c}start_FLOATSUPERSCRIPT italic_c end_FLOATSUPERSCRIPT [-1.43,2.18] 3.61 1.00 [-0.93,3.18] 4.11 1.00
D2 0.1 [0.10,1.08] 0.97 0.93 [0.90,2.07] 1.17 1.00
0.01 [-0.38,1.56] 1.94 1.00 [0.32,2.53] 2.21 1.00
0.001 [-0.41,1.59] 2.00 1.00 [0.05,2.57] 2.53 1.00
D3 0.1 [0.14,1.04] 0.91 0.69 [0.91,2.05] 1.14 1.00
0.01 [-0.33,1.50] 1.83 1.00 [0.34,2.47] 2.13 1.00
0.001 [-0.35,1.53] 1.88 1.00 [0.09,2.50] 2.41 1.00
D4 0.1 [0.21,0.97] 0.75 0.45 [0.93,2.02] 1.09 0.99
0.01 [-0.19,1.37] 1.56 0.95 [0.36,2.43] 2.07 1.00
0.001 [-0.21,1.38] 1.59 0.97 [0.13,2.45] 2.32 1.00
  • *

    the averages of the lower and upper bounds;

  • **

    the difference between the averages of the lower and upper bounds;

  • ***

    the proportion of inclusion of the true ATE between the lower and upper bounds;

  • a

    18.39% and 2.47% of the true propensity scores did not satisfy the constraints (23) or (26), respectively, in scenarios 1 and 2;

  • b

    0.29% and 0.01% of the true propensity scores did not satisfy the constraints (23) or (26), respectively, in scenarios 1 and 2;

  • c

    0.01% and 0.00% of the true propensity scores did not satisfy the constraints (23) or (26), respectively, in scenarios 1 and 2.

Table 2: Summary of the quantile balancing method with several settings of OR over 1,000 simulated datasets in two scenarios.
Scenario 1 (True ATE: 0.21) Scenario 2 (True ATE: 1.13)
λ𝜆\lambdaitalic_λ Bound*{}^{*}start_FLOATSUPERSCRIPT * end_FLOATSUPERSCRIPT Length**absent{}^{**}start_FLOATSUPERSCRIPT * * end_FLOATSUPERSCRIPT Coverage***absent{}^{***}start_FLOATSUPERSCRIPT * * * end_FLOATSUPERSCRIPT Bound Length Coverage
1 [0.01,0.01] / / [1.31,1.31] / /
1.2 [-0.13,0.14] 0.27 0.48 [1.16,1.47] 0.31 0.36
1.5 [-0.29,0.31] 0.61 0.89 [0.98,1.66] 0.68 0.96
2 [-0.51,0.54] 1.04 1.00 [0.74,1.90] 1.17 1.00
3 [-0.81,0.87] 1.68 1.00 [0.40,2.26] 1.85 1.00
5 [-1.22,1.35] 2.57 1.00 [-0.03,2.74] 2.77 1.00
  • *

    the averages of the lower and upper bounds;

  • **

    the difference between the averages of the lower and upper bounds;

  • ***

    the proportion of inclusion of the true ATE between the lower and upper bounds.

Table 3: Bounds of the ATE for the attention score in TONE study by the proposed method.
g(X)𝑔𝑋g(X)italic_g ( italic_X ) λ𝜆\lambdaitalic_λ Lower bound Upper bound Length BootLower*{}^{*}start_FLOATSUPERSCRIPT * end_FLOATSUPERSCRIPT BootUpper*{}^{*}start_FLOATSUPERSCRIPT * end_FLOATSUPERSCRIPT Feasibility**absent{}^{**}start_FLOATSUPERSCRIPT * * end_FLOATSUPERSCRIPT
D1 /***absent{}^{***}start_FLOATSUPERSCRIPT * * * end_FLOATSUPERSCRIPT -10.57 18.30 28.87 -11.50 19.27 1000
2 0.57 6.61 6.04 -0.36 7.59 987
3 -1.18 8.59 9.77 -2.39 9.48 1000
5 -3.36 10.77 14.14 -4.69 11.72 1000
D2 / -9.81 17.75 27.56 -10.83 18.39 1000
2 1.66 5.50 3.84 0.05 6.93 386
3 -0.92 8.18 9.10 -1.82 8.92 892
5 -3.19 10.38 13.57 -4.12 11.12 997
D3 / -9.70 17.23 26.93 -10.46 17.49 1000
2 / / / 0.42 6.73 66
3 -0.40 7.45 7.85 -1.41 8.43 593
5 -3.08 9.91 13.00 -3.67 10.59 957
D4 / -8.74 16.54 25.28 -8.26 16.17 991
2 / / / 0.85 6.39 55
3 -0.25 7.16 7.41 -0.86 7.85 540
5 -2.66 9.48 12.15 -2.92 9.93 920
  • *

    the lower and upper bounds of 95% bootstrap confidence interval for the lower and upper bounds of ATE, respectively;

  • **

    the number of the resampled datasets in the proposed method, in which feasible solution of the linear programming can be obtained;

  • ***

    the rows with /// in the λ𝜆\lambdaitalic_λ column show the results without OR-based constraint.

Table 4: Bounds of the ATE for the attention score in TONE study by the quantile balancing method.
λ𝜆\lambdaitalic_λ Lower bound Upper bound Length BootLower*{}^{*}start_FLOATSUPERSCRIPT * end_FLOATSUPERSCRIPT BootUpper*{}^{*}start_FLOATSUPERSCRIPT * end_FLOATSUPERSCRIPT
1 4.09 4.09 / 3.04 5.30
1.2 3.29 4.92 1.63 2.21 6.14
1.5 2.32 5.97 3.64 1.20 7.21
2 1.13 7.29 6.17 -0.01 8.59
3 -0.61 9.15 9.77 -1.85 10.58
5 -2.80 11.56 14.36 -4.25 13.27
  • *

    the lower and upper bounds of 95% bootstrap confidence interval for the lower and upper bounds of ATE, respectively.

Table 5: Bounds of the ATE for the attention score in TONE study by the proposed method with g(X)𝑔𝑋g(X)italic_g ( italic_X ) of additional four domains.
g(X)𝑔𝑋g(X)italic_g ( italic_X ) Lower bound Upper bound Length BootLower*{}^{*}start_FLOATSUPERSCRIPT * end_FLOATSUPERSCRIPT BootUpper*{}^{*}start_FLOATSUPERSCRIPT * end_FLOATSUPERSCRIPT Feasibility**absent{}^{**}start_FLOATSUPERSCRIPT * * end_FLOATSUPERSCRIPT
D1 -6.75 14.93 21.68 -8.84 15.84 1000
D2 -2.08 11.14 13.22 -4.29 12.44 657
D3 0.31 8.32 8.01 -3.14 11.33 323
D4 0.70 7.82 7.12 -2.45 10.51 220
  • *

    the lower and upper bounds of 95% bootstrap confidence interval for the lower and upper bounds of ATE, respectively;

  • **

    the number of the resampled datasets in the proposed method, in which feasible solution of the linear programming can be obtained.

Refer to caption
(a) The proposed method (δ=0.01𝛿0.01\delta=0.01italic_δ = 0.01)
Refer to caption
(b) The quantile balancing method
Figure 1: Boxplots of the lower and upper bounds for ATE obtained by the proposed method (left panel) and the quantile balancing method (right panel) over 1,000 simulated datasets in Scenario 1
Refer to caption
(a) The proposed method (δ=0.01𝛿0.01\delta=0.01italic_δ = 0.01)
Refer to caption
(b) The quantile balancing method
Figure 2: Boxplots of the lower and upper bounds for ATE obtained by the proposed method (left panel) and the quantile balancing method (right panel) over 1,000 simulated datasets in Scenario 2
==" alt="[LOGO]">