arxiv_cs_ai 2026年2月10日

IVコサイエンティスト:多重代理人のLLMフрейameworkのカusalinstrumental変数発見

IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery

Translated: 2026/3/7 9:39:38

causal-inferenceinstrumental-variableslanguage-modelsmulti-agent-system

Japanese Translation

結局のところ、内生変量とアウトカム間の混雑に関わらず、インストラクタービームは、内生変量の因果効果を離脱するための手法として使用されます。有効なインストラクターベームの識別には、多学科的で革新的な知見とコンテキストに対する理解が必要であり、その目標は実質的に難しいタスクです。この論文では、LLMがこの取組を支援するかどうか調査します。まず、我々はLLMが既存の文献からの正当なインストラクターベームを回復可能であるか確かめる試験を行い、その能力を標準的な推理へ復元可能な能力を評価します。二枚目では、我々がある種の実証的なまたは理論的に否定されたインストラクターベームを見つけないかどうかを評価します。我々の結果により、IVコサイエンティストというマルチアジェンターのシステムを開発し、特定の治療反応対アウトカムペアでの提案・批判・改善IVスキャナー機能を導入します。また、インスタンツベームが観測された大規模データベースの中で、適切な不確実性なしに一致するための統計的なテストを開発します。私たちはLLMによる有効なインストラクターベームの発見可能性を理解させる研究成果を示しています。

Original Content

arXiv:2602.07943v1 Announce Type: new Abstract: In the presence of confounding between an endogenous variable and the outcome, instrumental variables (IVs) are used to isolate the causal effect of the endogenous variable. Identifying valid instruments requires interdisciplinary knowledge, creativity, and contextual understanding, making it a non-trivial task. In this paper, we investigate whether large language models (LLMs) can aid in this task. We perform a two-stage evaluation framework. First, we test whether LLMs can recover well-established instruments from the literature, assessing their ability to replicate standard reasoning. Second, we evaluate whether LLMs can identify and avoid instruments that have been empirically or theoretically discredited. Building on these results, we introduce IV Co-Scientist, a multi-agent system that proposes, critiques, and refines IVs for a given treatment-outcome pair. We also introduce a statistical test to contextualize consistency in the absence of ground truth. Our results show the potential of LLMs to discover valid instrumental variables from a large observational database.