High-dimensional causal discovery under non-Gaussianity

Y. Samuel Wang, Mathias Drton

Research output: Contribution to journalArticlepeer-review

27 Scopus citations

Abstract

We consider graphical models based on a recursive system of linear structural equations. This implies that there is an ordering, σ, of the variables such that each observed variable Yv is a linear function of a variable-specific error term and the other observed variables Yu with σ(u) < σ (v). The causal relationships, i.e., which other variables the linear functions depend on, can be described using a directed graph. It has previously been shownthat when the variable-specific error terms are non-Gaussian, the exact causal graph, as opposed to a Markov equivalence class, can be consistently estimated from observational data. We propose an algorithm that yields consistent estimates of the graph also in high-dimensional settings in which the number of variables may grow at a faster rate than the number of observations, but in which the underlying causal structure features suitable sparsity; specifically, the maximum in-degree of the graph is controlled. Our theoretical analysis is couched in the setting of log-concave error distributions.

Original languageEnglish
Pages (from-to)41-59
Number of pages19
JournalBiometrika
Volume107
Issue number1
DOIs
StatePublished - 1 Mar 2020

Keywords

  • Causal discovery
  • Directed graphical model
  • High-dimensional statistics
  • Non-Gaussian data
  • Structural equation model

Fingerprint

Dive into the research topics of 'High-dimensional causal discovery under non-Gaussianity'. Together they form a unique fingerprint.

Cite this