arxiv_cs_lg 2026年4月24日

バイオイメージリングにおける基礎モデル：ハプを実現に変える

Foundation Models in Biomedical Imaging: Turning Hype into Reality

Translated: 2026/4/24 20:13:31

foundation-modelsbiomedical-imagingmedical-aihealthcare-technologyartificial-intelligence

Japanese Translation

arXiv:2512.15808v2 Announce Type: replace-cross 要約：基礎モデル（Foundation Models; FMs）は、タスク特異的モデルから多様なタスクに対応する統合的なバックボーンモデルへのバイオイメージリングの主要な転換を主導しています。これにより、イメージング、病理学、臨床記録、ゲノムデータなどの統合システムへのアクセスが開けました。しかし、このビジョンは、現代医学がより細分化された専門分野へと向かう軌道とは対照的です。この張力に加えて、データの希少性、ドメインの不均一性、解釈性の限界は、ベンチマークにおける成功と実際の臨床的価値との間にギャップを生み出しています。われわれは、FMs の即時的役割は臨床的専門性を置き換えるのではなく、それを補助することであると論じます。ハプと現実を分けるために、われわれは、データ、技術的準備、臨床的価値、ワークフロー統合、および責任ある AI を評価するための多面的なフレームワークを確立した REAL-FM（Real-world Evaluation and Assessment of Foundation Models）を導入します。REAL-FM を用いると、FMs はパターン認識においては優れていますか、因果論理、ドメインの頑健性、安全性においては不足していることがわかりました。臨床への翻訳は、モデルトレーニングのための代表性の低いデータ不足、過剰単純化されたベンチマーク設定を超えた未検証な一般化の欠如、そして先行予測結果に基づく検証の欠如により妨げられています。われわれはまた、FM の推理パラダイム、即ち順次論理、空間的理解、記号的ドメイン知識についても検討します。われわれは、未来の道は単一の大規模な医療の知恵の源であることではなく、透明性があり、安全性に配慮し、臨床に根ざした協調的な分科専門家の AI システムにあると描きます。

Original Content

arXiv:2512.15808v2 Announce Type: replace-cross Abstract: Foundation models (FMs) are driving a prominent shift in biomedical imaging from task-specific models to unified backbone models for diverse tasks. This opens an avenue to integrate imaging, pathology, clinical records, and genomics data into a composite system. However, this vision contrasts sharply with modern medicine's trajectory toward more granular sub-specialization. This tension, coupled with data scarcity, domain heterogeneity, and limited interpretability, creates a gap between benchmark success and real-world clinical value. We argue that the immediate role of FMs lies in augmenting, not replacing, clinical expertise. To separate hype from reality, we introduce REAL-FM (Real-world Evaluation and Assessment of Foundation Models), a multi-dimensional framework for assessing data, technical readiness, clinical value, workflow integration, and responsible AI. Using REAL-FM, we find that while FMs excel in pattern recognition, they fall short in causal reasoning, domain robustness, and safety. Clinical translation is hindered by scarce representative data for model training, unverified generalization beyond oversimplified benchmark settings, and a lack of prospective outcome-based validation. We further examine FM reasoning paradigms, including sequential logic, spatial understanding, and symbolic domain knowledge. We envision that the path forward lies not in a monolithic medical oracle, but in coordinated subspecialist AI systems that are transparent, safe, and clinically grounded.