arxiv_cs_ai 2026年2月10日

再考する科学モデル：物理一致性とシミュレーション実行可能プログラム生成に向けて

Rethinking Scientific Modeling: Toward Physically Consistent and Simulation-Executable Programmatic Generation

Translated: 2026/3/7 11:34:25

modellingscientific-computingconstraint-mlcivil-engineering

Japanese Translation

構造モデリングは計算機工学科学の基本的な要素であり、それだけで一部の物理的不一致であったり指定違反があったら、後続のシミュレーションが一からしかかってしまうでしょう。大きな言語モデル（LLM)を使った自動化されたモデリングコードを生成する可能性が示されました。しかし厳格な工科要件に対する実行不能あるいは物理的不一致の出力はまだ一般的です。そこで物理的統一性向けモデルビルディングのためのフレームワークを提案しました。これは専門知識の構築、制約優先モデル統合、検定適用評価を集成しているのが特徴です。CivilInstructという専門用のデータセットを導入し、規格や制約論理を形式的に表現した構造エンジニアリングの知識を提供します。そして二段階の細調整戦略を使用して、制約の満たされ方とアプリケーションプログラミングインターフェースの適合性を強化し、虚言の生成や非合致出力を大幅に減らしました。MBEvalは検査適用評価ベンチマークであり、自作用的な実行可能性と構造動的統一性について閉路確認を行うことで問題点を見つけ出します。実験結果をみると最も厳しい検証指標においてベースラインに対する安定した向上を行っています。私たちのコードは https://github.com/Jovanqing/AutoBM で利用可能です。

Original Content

arXiv:2602.07083v1 Announce Type: cross Abstract: Structural modeling is a fundamental component of computational engineering science, in which even minor physical inconsistencies or specification violations may invalidate downstream simulations. The potential of large language models (LLMs) for automatic generation of modeling code has been demonstrated. However, non-executable or physically inconsistent outputs remain prevalent under stringent engineering constraints. A framework for physics-consistent automatic building modeling is therefore proposed, integrating domain knowledge construction, constraint-oriented model alignment, and verification-driven evaluation. CivilInstruct is introduced as a domain-specific dataset that formalizes structural engineering knowledge and constraint reasoning to enable simulation-ready model generation. A two-stage fine-tuning strategy is further employed to enforce constraint satisfaction and application programming interface compliance, substantially reducing hallucinated and non-conforming outputs. MBEval is presented as a verification-driven benchmark that evaluates executability and structural dynamics consistency through closed-loop validation. Experimental results show consistent improvements over baselines across rigorous verification metrics. Our code is available at https://github.com/Jovanqing/AutoBM.