arxiv_cs_lg 2026年4月24日

黒箱モデルにおける主効果の推定に適用された累積集合 D-最良設計

Accumulated Aggregated D-Optimal Designs for Estimating Main Effects in Black-Box Models

Translated: 2026/4/24 20:12:56

black-box-modelsexplainable-mlexperimental-designd-optimal-designsmain-effect-estimation

Japanese Translation

arXiv:2510.08465v2 Announce Type: replace-cross 要旨：個々の入力変数が黒箱モデルの出力に与える影響を推定することは、可視性機械学習における中心的なタスクです。しかし、既存の方法は、2 つの主要な制約に苦しんでおり、それは、データ多様体が遠く離れた所での問い合わせ点を配置した場合に発生する分割外（OOD）評価への感受性、および実践において信頼性の低い効果推定を招く可能性がある特徴相関に対する不安定性です。我々は、主効果の推定を設計問題として統一した見方を提案し、既存のすべての方法が評価位置の選択のみで異なることを示しました。この形式に基づき、累積集合 D-最良設計に基づく推定器である A2D2E（Accumulated Aggregated D-Optimal Designs）を提案します。A2D2E は、変数の分散を最小限に抑えるために D-最良ハイパー立方体設計を用いて評価を置き換えます。A2D2E はモデル非依存であり、予測関数の微分は不要であり、既存の手法と比較した複雑性を持つ閉形式の推定器を受容します。我々は、A2D2E が ALE と同じ人口目標に整合性があることを確立し、代わりに代用モデルが利用される現実的な設定もこの結果を拡張しました。複数の予測モデルと依存関係設定に対する大規模なシミュレーションを通じて、A2D2E が ALE ベースの方法よりも優れていることを示しました。特に、高特徴相関の条件下で大きな改善が得られました。

Original Content

arXiv:2510.08465v2 Announce Type: replace-cross Abstract: Estimating how individual input variables affect the output of a black-box model is a central task in explainable machine learning. However, existing methods suffer from two key limitations: sensitivity to out-of-distribution (OOD) evaluations, which arises when query points are placed far from the data manifold, and instability under feature correlation, which can lead to unreliable effect estimates in practice. We introduce a unified view of main effect estimation as a design problem, which reveals that all existing methods differ only in their choice of evaluation locations. Building on this formulation, we propose A2D2E, an Estimator based on Accumulated Aggregated D-Optimal Designs, which replaces evaluations with a D-optimal hypercube design to minimize the variance of main effect estimation. A2D2E is model-agnostic, requires no differentiability of the predictor, and admits a closed-form estimator with complexity comparable to existing approaches. We establish that A2D2E is consistent to the same population target as ALE, and extend this result to the realistic setting where only a surrogate model is available. Through extensive simulations across multiple predictive models and dependence settings, we demonstrate that A2D2E outperforms ALE-based methods, with the largest gains under high feature correlation.