arxiv_cs_lg 2026年4月20日

境界干渉データを用いて全変数の連合介入分布の推定

Estimating Joint Interventional Distributions from Marginal Interventional Data

Translated: 2026/4/20 11:06:06

machine-learningcausal-inferencemaximum-entropyinterventional-dataprobability-distribution

Japanese Translation

arXiv:2409.01794v2 発表タイプ：交差検証の置換本稿では、最大エントロピー原理を用いて境界介入データを活用し、全変数の連合条件付き分布を取得する方法を示す。これに応じて、観測データに加えて介入データも利用する因果的マックスエントロピー手法を拡張した。ラグランジュ対偶性を活用し、介入制約が課せられた因果的マックスエントロピー問題の解が、マックスエントロピーの解と同様に指数族にあることを証明した。我々の手法は、任意の変数の部分集合に対して境界介入分布が与えられた際に以下の二つの興味深いタスクを実行することを可能にする。まず、観測データと単変数の介入データという混合物から因果特徴選択を実行する方法を示し、次いで、連合介入分布を推測する方法を示した。前者のタスクについては、合成生成データ上で、我々の提案手法がデータセットの統合において最先端の手法を上回り、すべての変数の連合観測にアクセスが必要である KCI 検定と同等の結果を得ることができたと証明した。

Original Content

arXiv:2409.01794v2 Announce Type: replace-cross Abstract: In this paper we show how to exploit interventional data to acquire the joint conditional distribution of all the variables using the Maximum Entropy principle. To this end, we extend the Causal Maximum Entropy method to make use of interventional data in addition to observational data. Using Lagrange duality, we prove that the solution to the Causal Maximum Entropy problem with interventional constraints lies in the exponential family, as in the Maximum Entropy solution. Our method allows us to perform two tasks of interest when marginal interventional distributions are provided for any subset of the variables. First, we show how to perform causal feature selection from a mixture of observational and single-variable interventional data, and, second, how to infer joint interventional distributions. For the former task, we show on synthetically generated data, that our proposed method outperforms the state-of-the-art method on merging datasets, and yields comparable results to the KCI-test which requires access to joint observations of all variables.