arxiv_cs_ai 2026年2月10日

グラフを駆使した深層 reinforcement learning の応用：マルチオブジェクト付き並行マシンシミュレーション

Graph-Enhanced Deep Reinforcement Learning for Multi-Objective Unrelated Parallel Machine Scheduling

Translated: 2026/3/7 9:49:13

reinforcement-learningmachine-schedulingmulti-objectivegraph-neural-networkdeep-learning

Japanese Translation

複雑な出番、セットアップ時間の要素が含まれる不受影響並列機器のスケジューリング上の問題（UPMSP）を対象に、この研究では多目的最適化を達成するためDeep reinforcement learning技術を使用しています。PPOプロキシ政策最適化とGNN（グラフニューラルネットワーク）が深層の学習を行い、その複雑な状態であるジョブやマシン、セットアップ時間を把握します。多目的報酬関数を導くことでそのアルゴリズムは両方の目標を同時に最小限に抑えることができます。

Original Content

arXiv:2602.08052v1 Announce Type: new Abstract: The Unrelated Parallel Machine Scheduling Problem (UPMSP) with release dates, setups, and eligibility constraints presents a significant multi-objective challenge. Traditional methods struggle to balance minimizing Total Weighted Tardiness (TWT) and Total Setup Time (TST). This paper proposes a Deep Reinforcement Learning framework using Proximal Policy Optimization (PPO) and a Graph Neural Network (GNN). The GNN effectively represents the complex state of jobs, machines, and setups, allowing the PPO agent to learn a direct scheduling policy. Guided by a multi-objective reward function, the agent simultaneously minimizes TWT and TST. Experimental results on benchmark instances demonstrate that our PPO-GNN agent significantly outperforms a standard dispatching rule and a metaheuristic, achieving a superior trade-off between both objectives. This provides a robust and scalable solution for complex manufacturing scheduling.