arxiv_cs_cv 2026年4月24日

Foundation モデルを用いた注意ベースの複数インスタンス学習：肺腺がんの全スライドレベルにおける主要成長パターン予測

Attention-based multiple instance learning for predominant growth pattern prediction in lung adenocarcinoma wsi using foundation models

Open original article

Translated: 2026/4/24 19:44:51

lung-adenocarcinomafoundation-modelsmultiple-instance-learningpathology-aiwhole-slide-image

Japanese Translation

arXiv:2604.21530v1 Announce Type: new 摘要：肺腺がん（LUAD）の grading は、成長パターンの正確な特定に依存しており、これは予後を示す指標であると同時に治療決定に影響を与える。一般的な深層学習アプローチは、パッチレベルの分類またはセグメンテーションに頼っており、大規模なアノテーションを必要とする。本論文は、アノテーション負荷を削減するために全スライドレベルにおける LUAD の主要成長パターンを予測するための、注意ベースの複数インスタンス学習（ABMIL）フレームワークを提案する。本手法は、事前訓練された病理基礎モデルをパッチエンコーダーとして統合しており、これらはアノテーション付きパッチ上で凍結または微調整され、空間注意力メカニズムを通じて集約された弁別能的特徴を抽出する。実験の結果、微調整されたエンコーダーはパフォーマンスを改善し、ABMIL 下で Prov-GigaPath の Agreement (kappa = 0.699) が最高であることが示された。単純なパッチ集約ベースラインと比較して、ABMIL はスライドレベルの監督情報と空間注意力を活用することで、より頑健な予測を生み出す。今後の工作是、このフレームワークを成長パターンの完全な分布の推定に拡張し、外部コホートでのパフォーマンスを検証する予定である。

Original Content

arXiv:2604.21530v1 Announce Type: new Abstract: Lung adenocarcinoma (LUAD) grading depends on accurately identifying growth patterns, which are indicators of prognosis and can influence treatment decisions. Common deep learning approaches to determine the predominant pattern rely on patch-level classification or segmentation, requiring extensive annotations. This study proposes an attention-based multiple instance learning (ABMIL) framework to predict the predominant LUAD growth pattern at the whole slide level to reduce annotation burden. Our approach integrates pretrained pathology foundation models as patch encoders, used either frozen or fine-tuned on annotated patches, to extract discriminative features that are aggregated through attention mechanisms. Experiments show that fine-tuned encoders improve performance, with Prov-GigaPath achieving the highest agreement (\k{appa} = 0.699) under ABMIL. Compared to simple patch-aggregation baselines, ABMIL yields more robust predictions by leveraging slide-level supervision and spatial attention. Future work will extend this framework to estimate the full distribution of growth patterns and validate performance on external cohorts.