arxiv_cs_cv 2026年2月10日

現実世界における画像超分解解像度のための双方向報酬導向拡散モデル Bird-SR

Bidirectional Reward-Guided Diffusion for Real-World Image Super-Resolution

Translated: 2026/3/15 17:04:43

diffusion-modelsimage-super-resolutionreward-modelingcomputer-vision

Japanese Translation

arXiv:2602.07069v1 Announce Type: new 摘要：拡散ベースの超分解解像度モデルは富な細部を合成できますが、合成された対応データで訓練されたモデルは、分布のシフトのために実際の低解像度 (LR) 画像で失敗する傾向があります。ここでは、合成 LR-HR 対と実際の LR 画像を同時に活用し、双方向の報酬導向拡散フレームワークである Bird-SR を提案します。超分解解像度を、報酬フィードバック学習 (ReFL) を通じた軌道レベルでの好む最適化として形式化します。構造的忠実性は ReFL で容易に影響を受けやすい要素であるため、モデルは拡散の初期段階で合成対に対して直接最適化されます。これにより、構造レベルにおける分布の隙が小さくなる実際の入力に対して構造の保持を容易にします。認識的な向上のために、品質導向の報酬は後方のサンプリングステップに適用され、合成および実際の LR 画像の両方に対して適用されます。報酬ハッキングを軽減するために、合成結果に対する報酬は、クリーンな対応物を基準とすることで相対的な利得空間に形式化され、かつ上限定り、実際の世界の最適化はセマンティック一致の制約を通じて正規化されます。さらに、構造的学習と認識的学習のバランスを取るために、初期段階では構造の保持を強調し、後方の拡散ステップにわたって徐々に認識的な最適化への焦点をシフトする動的忠実性 - 認識度重み付け戦略を採用しました。実際の SR ベンチマークにおける大規模な実験は、Bird-SR が認識的な質において state-of-the-art メソッドを一貫して凌駕しつつ構造的一貫性を保持し、その現実世界での超分解解像度に対する有効性を検証しました。

Original Content

arXiv:2602.07069v1 Announce Type: new Abstract: Diffusion-based super-resolution can synthesize rich details, but models trained on synthetic paired data often fail on real-world LR images due to distribution shifts. We propose Bird-SR, a bidirectional reward-guided diffusion framework that formulates super-resolution as trajectory-level preference optimization via reward feedback learning (ReFL), jointly leveraging synthetic LR-HR pairs and real-world LR images. For structural fidelity easily affected in ReFL, the model is directly optimized on synthetic pairs at early diffusion steps, which also facilitates structure preservation for real-world inputs under smaller distribution gap in structure levels. For perceptual enhancement, quality-guided rewards are applied at later sampling steps to both synthetic and real LR images. To mitigate reward hacking, the rewards for synthetic results are formulated in a relative advantage space bounded by their clean counterparts, while real-world optimization is regularized via a semantic alignment constraint. Furthermore, to balance structural and perceptual learning, we adopt a dynamic fidelity-perception weighting strategy that emphasizes structure preservation at early stages and progressively shifts focus toward perceptual optimization at later diffusion steps. Extensive experiments on real-world SR benchmarks demonstrate that Bird-SR consistently outperforms state-of-the-art methods in perceptual quality while preserving structural consistency, validating its effectiveness for real-world super-resolution.