arxiv_cs_ai 2026年4月24日

Nemobot Games: 大規模言語モデルを用いたインタラクティブ学習のための戦略的 AI ゲームエージェントの作成

Nemobot Games: Crafting Strategic AI Gaming Agents for Interactive Learning with Large Language Models

Translated: 2026/4/24 20:18:54

nemobotlarge-language-modelsai-game-playingreinforcement-learningagentic-environments

Japanese Translation

arXiv:2604.21896v1 Announce Type: new 摘要：本稿は、大規模言語モデル（LLM）を活用し、Claude Shannon のゲームプレイングマシン分類を拡張・実装するための新しいパラダイムを紹介する。このパラダイムの核となるのが、ユーザーが LLM パワーのゲームエージェントを作成、カスタマイズ、デプロイし、同時に AI 主導の戦略と関与できるインタラクティブなエージェントエンジニアリング環境である Nemobot である。Nemobot に統合された LLM ベースのチャットボットは、4 つの異なるゲームのカテゴリにわたってその能力を実証する。辞書型ゲームに対しては、状態とアクションのマッピングを効率的で一般化されたモデルに圧縮し、迅速な適応力を発揮する。厳密に解けるゲームにおいては、数学的推理を用いて最適な戦略を計算し、人間の読みやすい説明を生成する。ヒューリスティックベースのゲームにおいては、古典的なミニマックスアルゴリズム（例えば shannon1950chess を参照）の洞察とクラウドソーシングデータを組み合わせることで戦略を統合する。最後に、学習ベースのゲームにおいては、人間フィードバックと自己批評を組み合わせた強化学習を用い、試行錯誤と模倣学習を通じて戦略を反復的に精緻化する。Nemobot は、このフレームワークを、ツール拡張生成（tool-augmented generation）や戦略的ゲームエージェントのファインチューニングを試行錯誤できるプログラム可能な環境を提供することで増幅する。戦略ゲームからロールプレイングゲームに至るまで、Nemobot は、クラウドソーシング学習と人間の創造性を統合し、自身の論理を反復的に精緻化する自己プログラム化の形を AI エージェントが達成する方法を示す。これは、自己プログラム化 AI の長期的な目標への一歩を示すものである。

Original Content

arXiv:2604.21896v1 Announce Type: new Abstract: This paper introduces a new paradigm for AI game programming, leveraging large language models (LLMs) to extend and operationalize Claude Shannon's taxonomy of game-playing machines. Central to this paradigm is Nemobot, an interactive agentic engineering environment that enables users to create, customize, and deploy LLM-powered game agents while actively engaging with AI-driven strategies. The LLM-based chatbot, integrated within Nemobot, demonstrates its capabilities across four distinct classes of games. For dictionary-based games, it compresses state-action mappings into efficient, generalized models for rapid adaptability. In rigorously solvable games, it employs mathematical reasoning to compute optimal strategies and generates human-readable explanations for its decisions. For heuristic-based games, it synthesizes strategies by combining insights from classical minimax algorithms (see, e.g., shannon1950chess) with crowd-sourced data. Finally, in learning-based games, it utilizes reinforcement learning with human feedback and self-critique to iteratively refine strategies through trial-and-error and imitation learning. Nemobot amplifies this framework by offering a programmable environment where users can experiment with tool-augmented generation and fine-tuning of strategic game agents. From strategic games to role-playing games, Nemobot demonstrates how AI agents can achieve a form of self-programming by integrating crowdsourced learning and human creativity to iteratively refine their own logic. This represents a step toward the long-term goal of self-programming AI.