arxiv_cs_ai 2026年2月10日

DRAGOn: 定期的に更新されたコーパスに基づくRAGのデザイン

DRAGOn: Designing RAG On Periodically Updated Corpus

Translated: 2026/2/14 7:15:19

Original Content

arXiv:2507.05713v3 Announce Type: replace-cross Abstract: This paper introduces DRAGOn, method to design a RAG benchmark on a regularly updated corpus. It features recent reference datasets, a question generation framework, an automatic evaluation pipeline, and a public leaderboard. Specified reference datasets allow for uniform comparison of RAG systems, while newly generated dataset versions mitigate data leakage and ensure that all models are evaluated on unseen, comparable data. The pipeline for automatic question generation extracts the Knowledge Graph from the text corpus and produces multiple question-answer pairs utilizing modern LLM capabilities. A set of diverse LLM-as-Judge metrics is provided for a comprehensive model evaluation. We used Russian news outlets to form the datasets and demonstrate our methodology. We launch a public leaderboard to track the development of RAG systems and encourage community participation.