arxiv_cs_cv 2026年4月20日

頭頸部がんの予後予測における XAI メソッドのランキング付け

Ranking XAI Methods for Head and Neck Cancer Outcome Prediction

Translated: 2026/4/20 10:44:31

xaihealthcaremedical-imaginginterpretabilityartificial-intelligence

Japanese Translation

頭頸部がん（HNC）患者における予後予測は、個別化治療戦略の選定をサポートする役割を果たします。PET/CT データを応用した最先端人工知能（AI）技術を用いて、HNC 予後予測の性能向上が広く探求されています。しかし、AI の解釈可能性は依然として臨床導入における重要な障壁です。以前の研究が経験的に説明可能 AI（XAI）手法を選定したのに対し、本研究は初めて XAI メソッドの 13 種を 24 つの指標にわたって包括的に評価しランク付けしました。これらの指標には、忠実性、頑健性、複雑さ、妥当性が含まれています。複数センター HECKTOR チョレンジーンのデータセットに基づく実験結果は、異なる XAI メソッド間に評価項目ごとに大きな変動が見られることを示しており、統合勾配（IG）および DeepLIFT（DL）は忠実性、複雑さ、妥当性のいずれでも一貫して高いランクを得ています。本研究は、包括的な XAI メソッド評価の重要性を指摘し、他の画像解析タスクにも拡張適用可能な可能性を示唆しています。

Original Content

arXiv:2604.16034v1 Announce Type: new Abstract: For head and neck cancer (HNC) patients, prognostic outcome prediction can support personalized treatment strategy selection. Improving prediction performance of HNC outcomes has been extensively explored by using advanced artificial intelligence (AI) techniques on PET/CT data. However, the interpretability of AI remains a critical obstacle for its clinical adoption. Unlike previous HNC studies that empirically selected explainable AI (XAI) techniques, we are the first to comprehensively evaluate and rank 13 XAI methods across 24 metrics, covering faithfulness, robustness, complexity and plausibility. Experimental results on the multi-center HECKTOR challenge dataset show large variations across evaluation aspects among different XAI methods, with Integrated Gradients (IG) and DeepLIFT (DL) consistently obtained high rankings for faithfulness, complexity and plausibility. This work highlights the importance of comprehensive XAI method evaluation and can be extended to other medical imaging tasks.