arxiv_cs_lg 2026年2月10日

MedVerse: DAG 構造による並列実行を介した効率的かつ信頼性の高い医学推論

MedVerse: Efficient and Reliable Medical Reasoning via DAG-Structured Parallel Execution

Translated: 2026/3/15 14:08:17

medverselarge-language-modelsdirected-acyclic-graphpetri-netmedical-reasoning

Japanese Translation

論文 arXiv:2602.07529v1 発表タイプ：新規要旨：大規模言語モデル（LLM）は、幅広い医学推論タスクにおいて強力な性能と高速な進展を示しています。しかし、その順序論的 autoregressive（自己再帰的）デコードは、差異診断など本来に並列で行える臨床推論を単一の直線推論パスに強制し、複雑な医学問題における効率性と信頼性の両方を制限しています。これを解決するために、Petri net 理論に基づき、医学推論を並列化可能な有向非循環グラフ（DAG）プロセスとして再定式化する、複雑な医学推論のためのフレームワークである MedVerse を提案します。このフレームワークは、データ、モデルアーキテクチャ、およびシステム実行にわたるフルスタック設計を採用しています。データ作成においては、MedVerse Curator という自動化パイプラインを導入し、知見に基づく医学推論パスを合成し、それらを Petri net 構造型の表現へと変換します。アーキテクチャレベルでは、並列推論をサポートしつつ論理的整合性を保持する、適応的インデックスを持つトポロジー認識注意機構を提案しました。システム側では、追加のオーバーヘッドなしで並列実行をサポートするカスタム推論エンジンを開発しました。実験的評価では、MedVerse が強力な汎用 LLM を最大 8.9% 改善しました。専用の医学 LLM と比較すると、MedVerse は同等の性能を実現しつつ、並列デコード機能により推論レイテンシを 1.3 倍削減し、生成スループットを 1.7 倍向上させました。

Original Content

arXiv:2602.07529v1 Announce Type: new Abstract: Large language models (LLMs) have demonstrated strong performance and rapid progress in a wide range of medical reasoning tasks. However, their sequential autoregressive decoding forces inherently parallel clinical reasoning, such as differential diagnosis, into a single linear reasoning path, limiting both efficiency and reliability for complex medical problems. To address this, we propose MedVerse, a reasoning framework for complex medical inference that reformulates medical reasoning as a parallelizable directed acyclic graph (DAG) process based on Petri net theory. The framework adopts a full-stack design across data, model architecture, and system execution. For data creation, we introduce the MedVerse Curator, an automated pipeline that synthesizes knowledge-grounded medical reasoning paths and transforms them into Petri net-structured representations. At the architectural level, we propose a topology-aware attention mechanism with adaptive position indices that supports parallel reasoning while preserving logical consistency. Systematically, we develop a customized inference engine that supports parallel execution without additional overhead. Empirical evaluations show that MedVerse improves strong general-purpose LLMs by up to 8.9%. Compared to specialized medical LLMs, MedVerse achieves comparable performance while delivering a 1.3x reduction in inference latency and a 1.7x increase in generation throughput, enabled by its parallel decoding capability.