arxiv_cs_lg 2026年4月24日

RADS: 強化学習によるサンプル選択が、低リソースかつ不平衡な臨床設定における転移学習を改善する

RADS: Reinforcement Learning-Based Sample Selection Improves Transfer Learning in Low-resource and Imbalanced Clinical Settings

Open original article

Translated: 2026/4/24 20:04:27

reinforcement-learningtransfer-learningactive-learningsample-selectionclinical-datasets

Japanese Translation

arXiv:2604.20256v1 Announce Type: cross 要約：転移学習における一般的な戦略は数ショットファインチューニングですが、その成功はトレーニング例として選択されるサンプルの品質に大きく依存します。不確実性サンプリングや多様性サンプリングなどのアクティブ学習手法は有用なサンプルを抽出できますが、極めて低リソースかつクラス不平衡な条件下では、真に情報豊富なサンプルよりも外れ値を優先する傾向があり、性能が低下します。本研究では、強化学習（RL）を用いた堅牢なサンプル選択戦略である RADS（Reinforcement Adaptive Domain Sampling）を導入し、最も情報豊富なサンプルを特定することを提案します。複数の実際の臨床データセットでの実験評価では、従来手法と比較して、RADS のサンプル選択戦略がモデルの転移性向上を保ち、極端なクラス不平衡条件下でも堅牢な性能を発揮することを確認しました。

Original Content

arXiv:2604.20256v1 Announce Type: cross Abstract: A common strategy in transfer learning is few shot fine-tuning, but its success is highly dependent on the quality of samples selected as training examples. Active learning methods such as uncertainty sampling and diversity sampling can select useful samples. However, under extremely low-resource and class-imbalanced conditions, they often favor outliers rather than truly informative samples, resulting in degraded performance. In this paper, we introduce RADS (Reinforcement Adaptive Domain Sampling), a robust sample selection strategy using reinforcement learning (RL) to identify the most informative samples. Experimental evaluations on several real world clinical datasets show our sample selection strategy enhances model transferability while maintaining robust performance under extreme class imbalance compared to traditional methods.