arxiv_cs_ai 2026年4月24日

言語モデルによる数学的論理の構成分析：シューエンフェルッドの論理構造理論に基づく

Schoenfeld's Anatomy of Mathematical Reasoning by Language Models

Translated: 2026/4/24 20:33:45

language-modelsmathematical-reasoningepisode-theorythinkarmarxiv-2512.19995

Japanese Translation

arXiv:2512.19995v2 Announce Type: replace-cross 摘文：大規模言語モデルは論理的思考の過程を明らかにしてきていますが、その下の認知構造や思考手順は、表面の統計データを超えた段階で同定し解析するのが困難です。本研究では、シューエンフェルッドのエピソード理論を適用し、論理プロセスを分析、探索、実装、検証などの機能論的ステップとして明示的に抽象化するインダクティブな中規模視点を導入しました。この枠組みは「ThinkARM（ThinkARM：モデルにおける論理の構成分析）」と命名されます。この枠組みを多様なモデルの数学問題解決に適用すると、トークンレベルの視点からは明白ではない、再現性のある思考動向と、論理モデルと非論理モデル間の構造的差異が浮き彫りになります。また、本研究では、探索が正解との関連を持つ重要な分岐点であること、効率的な手法が評価フィードバックステップを統一的に短縮するのではなく、選択的に抑制する場合があるという 2 つの診断事例も提示しました。私たちの研究結果は、エピソードレベルの表現が思考手順を明示化し、現代の言語モデルにおいて論理がどのように構成され、安定化し、変化するかを系統的に分析することを可能にすることを示しています。

Original Content

arXiv:2512.19995v2 Announce Type: replace-cross Abstract: Large language models increasingly expose reasoning traces, yet their underlying cognitive structure and steps remain difficult to identify and analyze beyond surface-level statistics. We adopt Schoenfeld's Episode Theory as an inductive, intermediate-scale lens and introduce ThinkARM (Anatomy of Reasoning in Models), a scalable framework that explicitly abstracts reasoning traces into functional reasoning steps such as Analysis, Explore, Implement, Verify, etc. When applied to mathematical problem solving by diverse models, this abstraction reveals reproducible thinking dynamics and structural differences between reasoning and non-reasoning models, which are not apparent from token-level views. We further present two diagnostic case studies showing that exploration functions as a critical branching step associated with correctness, and that efficiency-oriented methods selectively suppress evaluative feedback steps rather than uniformly shortening responses. Together, our results demonstrate that episode-level representations make reasoning steps explicit, enabling systematic analysis of how reasoning is structured, stabilized, and altered in modern language models.