arxiv_cs_lg 2026年2月10日

接触アタッチメントに基づく政策：接触条件付付は強固なロボット有用性モデルを創出する

Contact-Anchored Policies: Contact Conditioning Creates Strong Robot Utility Models

Translated: 2026/3/15 8:10:30

robot-learningreinforcement-learningsimulationvlarobotics

Japanese Translation

arXiv:2602.09017v1 発表タイプ: cross 要約: ロボットの学習における主要なパラダイムは、ランタイムで言語プロンプトを使用して環境、実装、タスクを超えて一般化することを目指しています。このアプローチに根本的な緊張が存在します: 言語は、堅牢な操作に必要な具体的な物理的理解を導くにはあまりにも抽象的であることが多いからです。この研究では、Contact-Anchored Policies（CAP）を導入し、言語条件付付を空間上の物理的接触点に置き換えます。同時に、CAP を単一の汎用政策ではなく、モジュラー有用性モデルのライブラリとして構造化します。この因子化により、われわれは現実からシミュレーションへのイテレーションサイクルを実装できます：われわれは EgoGym という軽量シミュレーションベンチマークを構築し、実世界への展開前に我々のモデルとデータを迅速に失敗モードを特定し改善しました。接触条件付付とシミュレーションによるイテレーションによって、CAP は 3 つの基本的な操作技能において外箱から新しい環境と実装に対して一般化し、23 時間のデモンストレーションデータのみを使用しました。また、ゼロショット評価において、大規模な最先端の VLAs（視覚・言語統合アーキテクチャ）を 56% 上回りました。すべてのモデルチェックポイント、コードベース、ハードウェア、シミュレーション、データセットはオープンソース化されます。プロジェクトページ: https://cap-policy.github.io/

Original Content

arXiv:2602.09017v1 Announce Type: cross Abstract: The prevalent paradigm in robot learning attempts to generalize across environments, embodiments, and tasks with language prompts at runtime. A fundamental tension limits this approach: language is often too abstract to guide the concrete physical understanding required for robust manipulation. In this work, we introduce Contact-Anchored Policies (CAP), which replace language conditioning with points of physical contact in space. Simultaneously, we structure CAP as a library of modular utility models rather than a monolithic generalist policy. This factorization allows us to implement a real-to-sim iteration cycle: we build EgoGym, a lightweight simulation benchmark, to rapidly identify failure modes and refine our models and datasets prior to real-world deployment. We show that by conditioning on contact and iterating via simulation, CAP generalizes to novel environments and embodiments out of the box on three fundamental manipulation skills while using only 23 hours of demonstration data, and outperforms large, state-of-the-art VLAs in zero-shot evaluations by 56%. All model checkpoints, codebase, hardware, simulation, and datasets will be open-sourced. Project page: https://cap-policy.github.io/