arxiv_cs_lg 2026年2月10日

フェデレーテッドラーニングにおける適切なタイミングでの適正な報酬：R3T

Right Reward Right Time for Federated Learning

Translated: 2026/3/15 9:04:17

federated-learningcontract-theorymachine-learningincentive-mechanismstime-aware

Japanese Translation

arXiv:2503.07869v2 Announce Type: replace Abstract: フェデレーテッドラーニング（FL）における臨界学習期間（CLP）とは、低品質な貢献（例：スパースなトレーニングデータの可用性）がモデルオーナー（すなわちクラウドサーバー）のグローバルモデルのパフォーマンスを恒久的に損傷させる初期段階を指します。しかし、既存のインセンティブ機構は時間的同質性を示し、すべてのトレーニングラウンドを同様に重要とみなすために、CLP 段階に高品質な貢献を優先し、引き付けることに失敗しています。この非効率性は、プライバシー規制による情報非対称性でさらに悪化しており、クラウドはクライアントのトレーニング能力について知識を持っていないためです。これにより、ネグレクト選択が発生します。したがって、この論文では、クライアントの関与、特に CLP 段階を促すクラウドサーバーの効率最大化を図るために、時知覚契約理論に基づくインセンティブ枠組み Right Reward Right Time（R3T）を提案します。私たちは、実現したモデルパフォーマンスとクライアントへの報酬のトレードオフを捉え、クライアントの多様性（時間、システム能力、労力、参加時間）を明示的に考慮したクラウドユーティリティ関数を構築します。その後、個人の合理性、誘発整合性、予算実現可能性の制約を満たす最適な契約設計を導出した CLP 知覚インセンティブ機構を考案します。これにより、合理的なクライアントが早期に参加し、労力を貢献する動機づけを行います。適切なタイミングに適切な報酬を提供することで、私たちのアプローチは CLP 段階に最高品質な貢献を引き付けることができます。シミュレーションと概念実証の研究では、R3T は情報非対称性を緩和し、クラウドのユーティリティを増加させ、従来のインセンティブ機構と比較して卓越した経済的効率性を発揮したことが示されました。また、コンセプト実証の結果は、総クライアント数を最大 47.6% 削減し、収束時間を最大 300% 改善し、かつ競合的なテスト精度を達成することを示しています。

Original Content

arXiv:2503.07869v2 Announce Type: replace Abstract: Critical learning periods (CLPs) in federated learning (FL) refer to early stages during which low-quality contributions (e.g., sparse training data availability) can permanently impair the performance of the global model owned by the model owner (i.e., a cloud server). However, existing incentive mechanisms exhibit temporal homogeneity, treating all training rounds as equally important, thereby failing to prioritize and attract high-quality contributions during CLPs. This inefficiency is compounded by information asymmetry due to privacy regulations, where the cloud lacks knowledge of client training capabilities, leading to adverse selection. Thus, in this article, we propose a time-aware contract-theoretic incentive framework, named Right Reward Right Time (R3T), to encourage client involvement, especially during CLPs, to maximize the utility of the cloud server. We formulate a cloud utility function that captures the trade-off between the achieved model performance and rewards allocated for clients' contributions, explicitly accounting for client heterogeneity in time and system capabilities, effort, and joining time. Then, we devise a CLP-aware incentive mechanism deriving an optimal contract design that satisfies individual rationality, incentive compatibility, and budget feasibility constraints, motivating rational clients to participate early and contribute efforts. By providing the right reward at the right time, our approach can attract the highest-quality contributions during CLPs. Simulation and proof-of-concept studies show that R3T mitigates information asymmetry, increases cloud utility, and yields superior economic efficiency compared to conventional incentive mechanisms. Our proof-of-concept results demonstrate up to a 47.6% reduction in the total number of clients and up to a 300% improvement in convergence time while achieving competitive test accuracy.