arxiv_cs_lg 2026年4月24日

SkillGraph: LLM エージェントのツールシーケンス推薦のためのグラフ基礎優先知識

SkillGraph: Graph Foundation Priors for LLM Agent Tool Sequence Recommendation

Translated: 2026/4/24 20:01:59

llm-agenttool-usegraph-priorsretrievalranking

Japanese Translation

arXiv:2604.19793v1 Announce Type: cross 抽象文: LLM エージェントは、大規模な API ライブラリーからツールを選択し、正しく順序付けなければなりません。既存の手法は、検索と順序付けの両方に半義類似性を利用していますが、順序付けはツール記述から欠如しているツール間のデータ依存性に依存します。その結果、半義だけの手法は構造化されたワークフロードメインにおいてネガティブな Kendall-$\tau$ を生み出します。我々は、49,831 回の成功した LLM エージェント軌跡から掘り起こされた有向付重実行遷移グラフ「SkillGraph」を紹介し、これは再利用可能なグラフ基礎優先知識としてワークフロー優先順位の規則性をエンコードします。このグラフ基礎優先知識に基づき、我々は候補選出のための GS-Hybrid 検索と、順序付けのための学習済みペアワイズリランカーから構成される 2 ステージデカップルドフレームワークを提案します。ToolBench（9,965 つのテストインスタンス；約 16,000 つのツール）では、Set-F1 = 0.271、Kendall-$\tau$ = 0.096 と達成しました。API-Bank において、Kendall-$\tau$ は -0.433 から +0.613 へと改善されました。同一の Stage-1 インプット条件下において、学習されたリランカーは LLaMA-3.1-8B Stage-2 リランカーにも優れています。

Original Content

arXiv:2604.19793v1 Announce Type: cross Abstract: LLM agents must select tools from large API libraries and order them correctly. Existing methods use semantic similarity for both retrieval and ordering, but ordering depends on inter-tool data dependencies that are absent from tool descriptions. As a result, semantic-only methods can produce negative Kendall-$\tau$ in structured workflow domains. We introduce SkillGraph, a directed weighted execution-transition graph mined from 49,831 successful LLM agent trajectories, which encodes workflow-precedence regularities as a reusable graph foundation prior. Building on this graph foundation prior, we propose a two-stage decoupled framework: GS-Hybrid retrieval for candidate selection and a learned pairwise reranker for ordering. On ToolBench (9,965 test instances; ~16,000 tools), the method reaches Set-F1 = 0.271 and Kendall-$\tau$ = 0.096; on API-Bank, Kendall-$\tau$ improves from -0.433 to +0.613. Under identical Stage-1 inputs, the learned reranker also outperforms LLaMA-3.1-8B Stage-2 rerankers.