arxiv_cs_ai 2026年2月10日

SHERLOCK：LLM増強済みECリスク管理におけるダイナミック知識適合化に向けた

SHERLOCK:Towards Dynamic Knowledge Adaptation in LLM-enhanced E-commerce Risk Management

Translated: 2026/2/14 8:07:12

Japanese Translation

高度な攻撃状態での広範多様なデータの連携と結びつきを検討する伝統的な手作業的調査は労力を要し、効率性を制限します。大規模言語モデル（LLM）がこれらの分析を自動化する可能性があることを示わす一方、リスクシナリオの複雑さと深い専門知識の希少性からデプロイメントは困難です。これら2つの課題に対処するため、我々はSherlockという包括的なフレームワークを開発しました。これは構造化された分野での知恵を集約した知識ベース(KB)を組み込んだLLMによる推理と結びつけた3つの核心的なモジュールを通じて構築されています。最初に、特定の知識源によって多様な知識ソースから構成される専門知識に基づいて知的所有権を構築します。第二にはケース調査のために改良された2フェーズのリtrieval-augmented生成戦略設計されており、入力の文脈補助、リフレクション・アンド・リファインーモジュールによりKBを完全に利用して分析品質向上につなげます。最終的には運営と注釈プラットフォームを開発し、データフライウェイジに対して自律的なモデルのアップデートを通じて継続的なシステムの進化を促します。KBの更新とともにリアルタイムのホットファクスが組み込まれた定期的ロジックシート修正を通じて、攻撃状態からの driftに対応するための補給が可能です。JD・ドット・コムでのオンラインA/Bテストにより、Sherlockは82%のexpert acceptace rate（EAR）と日間調査処理速度の386.7倍を達成しました。追加90日間の評価でも、モデル自動アップデートを通じて性能損失が再び進行したときの効果があり、EARは約3. 5%上昇しています。

Original Content

arXiv:2510.08948v3 Announce Type: replace-cross Abstract: Effective e-commerce risk management requires in-depth case investigations to identify emerging fraud patterns in highly adversarial environments. However, manual investigation typically requires analyzing the associations and couplings among multi-source heterogeneous data, a labor-intensive process that limits efficiency. While Large Language Models (LLMs) show promise in automating these analyses, their deployment is hindered by the complexity of risk scenarios and the sparsity of long-tail domain knowledge. To address these challenges, we propose Sherlock, a framework that integrates structured domain knowledge with LLM-based reasoning through three core modules. First, we construct a domain Knowledge Base (KB) by distilling structured expertise from heterogeneous knowledge sources. Second, we design a two-stage retrieval-augmented generation strategy tailored for case investigation, which combines input contextual augmentation with a Reflect & Refine module to fully leverage the KB for improved analysis quality. Finally, we develop an integrated platform for operations and annotation to drive a self-evolving data flywheel. By combining real-time hotfixes through KB updates with periodic logic alignment via post-training, we facilitate continuous system evolution to counteract adversarial drifts. Online A/B tests at JD dot com demonstrate that Sherlock achieves an 82% Expert Acceptance Rate (EAR) and a 386.7% increase in daily investigation throughput. An additional 90-day evaluation shows that the flywheel successfully recovers from performance decay caused by changing tactics twice, raising the EAR ceiling by around 3.5% through autonomous model updates.