Identification of in-sample positivity violations using regression trees: The PoRT algorithm

Archive ouverte

Danelian, Gabriel | Foucher, Yohann | Léger, Maxime | Le Borgne, F | Chatton, Arthur

Edité par CCSD ; De Gruyter -

International audience. BackgroundThe positivity assumption is crucial when drawing causal inferences from observational studies, but it is often overlooked in practice. A violation of positivity occurs when the sample contains a subgroup of individuals with an extreme relative frequency of experiencing one of the levels of exposure. To correctly estimate the causal effect, we must identify such individuals. For this purpose, we suggest a regression tree-based algorithm.DevelopmentBased on a succession of regression trees, the algorithm searches for combinations of covariate levels that result in subgroups of individuals with a low (un)exposed relative frequency.ApplicationWe applied the algorithm by reanalyzing four recently published medical studies. We identified the two violations of the positivity reported by the authors. In addition, we identified ten subgroups with a suspicion of violation.ConclusionsThe PoRT algorithm helps to detect in-sample positivity violations in causal studies. We implemented the algorithm in the R package RISCA to facilitate its use.

Suggestions

Du même auteur

G-computation, propensity score-based methods, and targeted maximum likelihood estimator for causal inference with different covariates sets: a comparative simulation study

Archive ouverte | Chatton, Arthur | CCSD

International audience. Controlling for confounding bias is crucial in causal inference. Distinct methods are currently employed to mitigate the effects of confounding bias. Each requires the introduction of a set o...

Causal inference in case of near‐violation of positivity: comparison of methods

Archive ouverte | Léger, Maxime | CCSD

International audience. In causal studies, the near‐violation of the positivity may occur by chance, because of sample‐to‐sample fluctuation despite the theoretical veracity of the positivity assumption in the popul...

G-computation and machine learning for estimating the causal effects of binary exposure statuses on binary outcomes

Archive ouverte | Le Borgne, F | CCSD

International audience. Abstract In clinical research, there is a growing interest in the use of propensity score-based methods to estimate causal effects. G-computation is an alternative because of its high statist...

Chargement des enrichissements...