arxiv_cs_ai 2026年4月24日

RealRoute: Retrieve-then-Verify パラダイムに基づく動的クエリルーティングシステム

RealRoute: Dynamic Query Routing System via Retrieve-then-Verify Paradigm

Translated: 2026/4/24 20:20:10

ragllmquery-routingretrievalopen-source

Japanese Translation

arXiv:2604.20860v1 発表タイプ：クロス要約：検索拡張生成（RAG）により外部知識で LLM をアンカーする成果は顕著ですが、異種化されたソース（例：専用データベース、グローバルコーパス、API）への適用は依然として大きな課題です。既存のアプローチは、通常「LLM アス・ア・ルーター（LLM としてルーティング）」を採用し、分解されたサブクエリを予測的に特定のソースへ分散させています。しかし、この戦略は異なるデータソースの意味的理解に過度に依存しており、ソース境界があいまいなケースではルーティングエラーを引き起こす傾向があります。本研究では、予測的なルーティングから堅牢な「Retrieve-then-Verify（検索しなぐら）」メカニズムへのパラダイム転換を導入する RealRoute システムを提案します。RealRoute は、並列的かつソース無關の検索を通じて証拠の完全性を確保し、その結果を検証して事実に基づいた答えを合成する、動的な検証器を用います。私たちのデモでは、ユーザーがリアルタイムの「再ルーティング」プロセスを視覚化し、複数の知識サイロを超えた検証チェーンを検索できます。実験により、RealRoute は多段 RAG 推論タスクにおいて予測基準を著しく凌駕したことが確認されました。RealRoute システムは、直感的な Web インターフェースを備えたオープンソースツールセットとして公開されており、コードは URL: https://github.com/Joseph1951210/RealRoute に利用可能です。

Original Content

arXiv:2604.20860v1 Announce Type: cross Abstract: Despite the success of Retrieval-Augmented Generation (RAG) in grounding LLMs with external knowledge, its application over heterogeneous sources (e.g., private databases, global corpora, and APIs) remains a significant challenge. Existing approaches typically employ an LLM-as-a-Router to dispatch decomposed sub-queries to specific sources in a predictive manner. However, this "LLM-as-a-Router" strategy relies heavily on the semantic meaning of different data sources, often leading to routing errors when source boundaries are ambiguous. In this work, we introduce RealRoute System, a framework that shifts the paradigm from predictive routing to a robust Retrieve-then-Verify mechanism. RealRoute ensures \textit{evidence completeness through parallel, source-agnostic retrieval, followed by a dynamic verifier that cross-checks the results and synthesizes a factually grounded answer}. Our demonstration allows users to visualize the real-time "re-routing" process and inspect the verification chain across multiple knowledge silos. Experiments show that RealRoute significantly outperforms predictive baselines in the multi-hop Rag reasoning task. The RealRoute system is released as an open-source toolkit with a user-friendly web interface. The code is available at the URL: https://github.com/Joseph1951210/RealRoute.