
Theory of g-computation
Core Clinical Sciences
transportGCMath.Rmd
Let be the outcome of interest, be the (binary, coded as 0 for control and 1 for treatment) treatment, be the indicator for study participation (so that means that the subject is in the population of the original study, while means that the subject is in the target population), be covariates to control for confounding in the original study and be effect modifiers. Let and be counterfactual outcomes associated with control and treatment, respectively. The primary objective of transportability analysis is to estimate the ATE in the target population:
Simply taking the difference in sample means using the original study data will only unbiasedly estimate the quantity which is different from the target ATE due to confounding and the different distributions of effect modifiers.
Under the assumptions specified in the Assumptions vignette, we have
Therefore, we can estimate unbiasedly by averaging samples of over the distribution of and in the target data. We can obtain such samples by first fitting a regression model of in terms of , and , which essentially fits a model of . Then, this model is used to calculate fitted values of at observed values of and in the target data with being set to . The fitted values may be seen as βsamplesβ of where and are empirically drawn from the target data, so they are averaged to obtain an estimate of the counterfactual mean above. From this, the target ATE may be estimated unbiasedly, provided that the outcome model is specified correctly.
For more information, check out the βWhat Ifβ book on causal inference (HernΓ‘n and Robins 2024), which includes a discussion of g-computation for confounding adjustment.