arxiv_cs_ai 2026年2月10日

AIxCC：DARPAのAI暗号技術チャレンジ：コンペティション設計、アーキテクチャ、および学んだこと

SoK: DARPA's AI Cyber Challenge (AIxCC): Competition Design, Architectures, and Lessons Learned

Translated: 2026/3/7 12:39:21

ai-cyberdarpa-ai-challengeautonomous-systemscrs-aibig-language-models

Japanese Translation

DARPA の AI 暗号技術チャレンジ（AIxCC）は、特に大規模言語モデルを活用してリアルワールドのオープンソースソフトウェアにおける脆弱性を特定と修復し得る完全自動的な暗号論理システム（CRSs）を開発するための大規模なコンペティションにあたります。この記事は、AIxCCを最初となる全体的な分析を行っています。設計文書、サブルーチンコード、実行トレース、また組織者と競争チームとの議論を含めることで、コンペティションの構造及び主なものに関する決定について探求めています。また選出されたCRSの構造的なアプローチを性格付けます。そして、コンペティション結果はファイナルスコアボードのみに限定されることはありません。この解析を通じてCRSのパフォーマンスが大きく影響を与える要因を明らかにしますのはもちろんのこと、チーム達が真の技術的な発展を遂げたことを示す点もありますまたそれ以外にも、今後の研究で課題として待たい点を見つけることも可能です。結論的にはこれについては未来におけるコンペティションを開催する際の教訓と、実践的にCRSを導入することへの関心事に対する更なる洞察についてまとめています。

Original Content

arXiv:2602.07666v1 Announce Type: cross Abstract: DARPA's AI Cyber Challenge (AIxCC, 2023--2025) is the largest competition to date for building fully autonomous cyber reasoning systems (CRSs) that leverage recent advances in AI -- particularly large language models (LLMs) -- to discover and remediate vulnerabilities in real-world open-source software. This paper presents the first systematic analysis of AIxCC. Drawing on design documents, source code, execution traces, and discussions with organizers and competing teams, we examine the competition's structure and key design decisions, characterize the architectural approaches of finalist CRSs, and analyze competition results beyond the final scoreboard. Our analysis reveals the factors that truly drove CRS performance, identifies genuine technical advances achieved by teams, and exposes limitations that remain open for future research. We conclude with lessons for organizing future competitions and broader insights toward deploying autonomous CRSs in practice.