Causal mediation analysis with double machine learning

Helmut Farbmacher, Martin Huber, Lukáš Laffers, Henrika Langen, Martin Spindler

Research output: Contribution to journalArticlepeer-review

28 Scopus citations

Abstract

This paper combines causal mediation analysis with double machine learning for a data-driven control of observed confounders in a high-dimensional setting. The average indirect effect of a binary treatment and the unmediated direct effect are estimated based on efficient score functions, which are robust with respect to misspecifications of the outcome, mediator, and treatment models. This property is key for selecting these models by double machine learning, which is combined with data splitting to prevent overfitting. We demonstrate that the effect estimators are asymptotically normal and n-1/2-consistent under specific regularity conditions and investigate the finite sample properties of the suggested methods in a simulation study when considering lasso as machine learner. We also provide an empirical application to the US National Longitudinal Survey of Youth, assessing the indirect effect of health insurance coverage on general health operating via routine checkups as mediator, as well as the direct effect.

Original languageEnglish
Pages (from-to)277-300
Number of pages24
JournalEconometrics Journal
Volume25
Issue number2
DOIs
StatePublished - 1 May 2022

Keywords

  • causal mechanisms
  • direct and indirect effects
  • double machine
  • efficient score
  • learning
  • mediation

Fingerprint

Dive into the research topics of 'Causal mediation analysis with double machine learning'. Together they form a unique fingerprint.

Cite this