arxiv_cs_ai 2026年4月24日

生物兵器化に向けたモデル能力評価とサファード：arXiv:2604.19811v2 Announce Type: replace-cross

Model Capability Assessment and Safeguards for Biological Weaponization

Translated: 2026/4/24 20:35:52

model-capability-assessmentbiological-safetyai-governancegenerative-aiprompt-benchmarks

Japanese Translation

AI リーダーと安全レポートは、モデルの推論の進歩が、専門性の低いユーザーを含む者による生物学的誤用の可能性を可能化すると警告し、主要なラボではサファードの拡大が継続しているがまだ未確定であるという状況を説明している。本研究では、ChatGPT 5.2 Auto、Gemini 3 Pro Thinking、Claude Opus 4.5、および Meta の Muse Spark Thinking の 73 つの素人向けオープンエンドな良質な STEM プロンプトでベンチマークを行い、運用インテリジェンスを測定した。良質な定量的タスクにおいて、Gemini と Meta は非常に高いスコアを記録し、ChatGPT は一部機能したがテキストが希薄化しており、Claude は最も疎く、いくつかの目に見える偽陽性の拒否を示した。2 つ目のテストセットは微妙な有害意図を検出し、エッジケースのプロンプトが Gemini の文脈認識能力の欠如を浮き彫りにした。これらの結果は、Gemini に焦点を当てた兵器化の分析を要請し、能力が規制のキャリブレーションを上回っているためである。Gemini は 4 つのアクセス環境でテストされ、報告される事例には毒タンニン（ポイザー・アイヴィー）による大勢の公共交通機関へのエスカレーション、国際匿名のログアウト済み AI モードを通じた毒の製造と抽出、および他の懸念すべき例が含まれる。生物学的誤用は地政学的なツールとしてより頻繁に採用されうるため、米国の政策対応の緊急性が高まっている。モデル出力が規制された技術データとみなされる場合、特にその傾向が見られる可能性がある。25 つのハイリスクエージェントに対するガイダンスを提供し、正当なユースケースとハイリスクユースケースの区別を支援する。

Original Content

arXiv:2604.19811v2 Announce Type: replace-cross Abstract: AI leaders and safety reports increasingly warn that advances in model reasoning may enable biological misuse, including by low-expertise users, while major labs describe safeguards as expanding but still evolving rather than settled. This study benchmarks ChatGPT 5.2 Auto, Gemini 3 Pro Thinking, Claude Opus 4.5 and Meta's Muse Spark Thinking on 73 novice-framed, open-ended benign STEM prompts to measure operational intelligence. On benign quantitative tasks, both Gemini and Meta scored very high; ChatGPT was partially useful but text-thinned, and Claude was sparsest with some apparent false-positive refusals. A second test set detected subtle harmful intent: edge case prompts revealed Gemini's seeming lack of contextual awareness. These results warranted a focused weaponization analysis on Gemini as capability appeared to be outpacing moderation calibration. Gemini was tested across four access environments and reported cases include poison-ivy-to-crowded-transit escalation, poison production and extraction via international-anonymous logged-out AI Mode, and other concerning examples. Biological misuse may become more prevalent as a geopolitical tool, increasing the urgency of U.S. policy responses, especially if model outputs come to be treated as regulated technical data. Guidance is provided for 25 high-risk agents to help distinguish legitimate use cases from higher-risk ones.