
Theory of target aggregate data adjustment transportability analysis
Core Clinical Sciences
transportTADAMath.Rmd
Let be the outcome of interest, be the binary treatment (coded as 0 for control and 1 for treatment). be the indicator for study participation (so that means that the subject is in the population of the original study, while means that the subject is in the target population). be covariates to control for confounding in the original study and be effect modifiers. Let and be counterfactual outcomes associated with control and treatment, respectively. The primary objective of transportability analysis is to estimate the average treatment effect (ATE) in the target population:
Simply taking the difference in sample means using the original study data will only unbiasedly estimate the quantity which is different from the target ATE due to confounding and the different distributions of effect modifiers.
Let and To control for confounding, the estimator will unbiasedly estimate the quantity which uses the first set of weights and is the IP weighting approach in causal inference. However, to estimate the target ATE, the estimator should be used instead, which incorporates the second set of weights to unbiasedly estimate the target ATE. This is extended to estimate the coefficients of any marginal structural model in the target population in the same manner as IP weighting: more specifically, the marginal structural model coefficients are estimated by fitting regression models on the original study data with the weights .
Assume that there is individual patient-level data (IPD) available for the study data, but only aggregate-level data (AgD) is available for the target data. The first set of weights, can be obtained using logistic regression on the study data. However, this is not possible for , so we use a method of moments (MoM) approach to obtain it instead. Let denote effect modifiers that we wish to adjust for. Let be the value of the effect modifiers for each individual patient in study sample. Let be the mean of in the target sample.
We assume that the weight for individual patient is These weights implicitly mimic the inverse odds of participation weighting approach when IPD for target data is available. Instead of logistic regression, the coefficients and in the weight will be estimated to satisfy the equality constraint:
Once is obtained, the remainder of the analysis proceeds as for inverse odds of participation weighting. For more information, check out the βWhat Ifβ book on causal inference (HernΓ‘n and Robins 2024) and the introduction of method of moments by Phillippo et al. (Phillippo et al. 2018) originally in indirect treatment comparison (ITC) application.