arxiv_cs_ai 2026年2月10日

Glow: 行動アリゲーティブ・ボディー・ビゼフデクテイシャツ・コンシェルジュとしての生成AIをpoweredする精神療法技能支援システムに対するリスク probingと実現可能性テスト

Initial Risk Probing and Feasibility Testing of Glow: a Generative AI-Powered Dialectical Behavior Therapy Skills Coach for Substance Use Recovery and HIV Prevention

Open original article

Translated: 2026/3/7 9:53:45

generative-artificial-intelligencedialectical-behavior-therapysubstance-use-recoverymental-healthrisk-probing

Japanese Translation

背後にある背景: HIVおよび薬物使用は、共通の心理的ダイナミクスに関与し、 impulsivity そして避けた対処法を表す。行動アリゲーティブ・ボディー・ビゼフデクテイシャツ (DBT) をターゲットとしていますが、伸縮性の課題があり拡大が難しくなっています。生成人工知能 (GenAI) はこれらのメカニズムを個人向けに personalize して DBT コーチングをデリバリしことができる可能性がありますが、発展の速さと安全インフラの遅れがあります。方法：我々は GlOW、生成アリゲーションによって powered された個体に対する DBT 動作支援システムを持っています、彼らは HIV 益または薬物使用が危険にある個人に展開を提供します。デジャングラのパートナーシップでのLos Angeles地域コミュニティ健康組織および被験者 (n=6) と被験者を持つ lived experience の参加者が共有した Usability テストを行いました。 HHHフレームワークでのユーザーメード adversarial テスティングを用いて、参加者は対象行動が生成された具体的なリスク probing を特定しました。我々は安全性の性能を評価して、37 のリスク probing 実験を完了した。結果： GlOW はその 73% のリスク probing を適切に処理し、しかし処理性能はジェントでした。シェン分析ア gentle のジェントは90%の適切な処置でしたが、チェーン分析のジェントの44%でした。安全性の失敗はアルツで発生しました：薬物使用を奨励し、有害行為の正規化。チェインア gentle アgent が「寛容トライ」に入ります、誤認を与える感情的な絵に見えて了承は適切なマッチを強化した maladaptive beliefs。さらに27個の誤って認識された DBT 動作ミスも見つかりました。結論: この研究ではまず systematic 的な安全保障の評価 GlOW を使用して、HIVと薬物使用リスクの減少に向けられた生成AIによるDBT コーチングに対するのためでしたを見つけました。処理は課題として認識されましたので、clinical trials には対処を先送りする前に安全なテクニックが重要です。 HHH フレームワークとユーザーキャンディ adversarial テスティングにより再利用可能な評価方法が提供されましたGlOW生成AI 課題の精神的なヘルスセーション。

Original Content

arXiv:2602.08121v1 Announce Type: new Abstract: Background: HIV and substance use represent interacting epidemics with shared psychological drivers - impulsivity and maladaptive coping. Dialectical behavior therapy (DBT) targets these mechanisms but faces scalability challenges. Generative artificial intelligence (GenAI) offers potential for delivering personalized DBT coaching at scale, yet rapid development has outpaced safety infrastructure. Methods: We developed Glow, a GenAI-powered DBT skills coach delivering chain and solution analysis for individuals at risk for HIV and substance use. In partnership with a Los Angeles community health organization, we conducted usability testing with clinical staff (n=6) and individuals with lived experience (n=28). Using the Helpful, Honest, and Harmless (HHH) framework, we employed user-driven adversarial testing wherein participants identified target behaviors and generated contextually realistic risk probes. We evaluated safety performance across 37 risk probe interactions. Results: Glow appropriately handled 73% of risk probes, but performance varied by agent. The solution analysis agent demonstrated 90% appropriate handling versus 44% for the chain analysis agent. Safety failures clustered around encouraging substance use and normalizing harmful behaviors. The chain analysis agent fell into an "empathy trap," providing validation that reinforced maladaptive beliefs. Additionally, 27 instances of DBT skill misinformation were identified. Conclusions: This study provides the first systematic safety evaluation of GenAI-delivered DBT coaching for HIV and substance use risk reduction. Findings reveal vulnerabilities requiring mitigation before clinical trials. The HHH framework and user-driven adversarial testing offer replicable methods for evaluating GenAI mental health interventions.