arxiv_cs_lg 2026年2月10日

Neural TSP 表現を調べる：推定型意思決定支援のための応用可能性

Probing Neural TSP Representations for Prescriptive Decision Support

Translated: 2026/3/15 13:05:00

neural-combinatorial-optimizationtraveling-salesperson-problemprescriptive-analyticstransfer-learningdeep-probing

Japanese Translation

arXiv:2602.07216v1 発表タイプ：新しい摘要：神経組合せ最適化（NCO）の分野では、旅行商問題（TSP）などの NP 困難問題を解くために神経ポリシーが訓練されています。私たちは、良いルートを生成するだけでなく、訓練された TSP ソルバーが他の最適化に関連する目的に転移可能な内部表現を学習するか、という点について検討します。これは他の領域からの転移学習の精神に合致します。我々はいくつかのアテンションベースの TSP ポリシーを訓練し、その内部アクティベーションを収集し、実際の物流シナリオに着想を得た 2 つの NP 困難な下流推定型タスク（ノード除去感受性：最も影響のあるノードを特定し、エッジ禁止感受性：最も重要なエッジを特定する）に対してノード/エッジ埋め込み探针を訓練しました。ユークリッド TSP100 で訓練されたモデルについて、両方のタスクの探针結果は既存のベースラインと競争力があります。幾何学的特徴と探针信号をアンサンブル化することで、最も強固なベースラインを超えることができます：最も良いノード除去タスクに対して 65% のトップ 1 精度（ベースライン 58% 対比）とし、最悪のエッジ識別タスクに対して 73% のトップ 1 精度（ベースライン 67% 対比）を得ました。我々の知る限り、これはルートの構築を超える推定型「もし何か」意思決定支援の目的のために転移可能なエンコーダーとして神経 TSP ソルバーを研究する最初の試みです。最後に、我々は転移精度がソルバーの質と訓練、モデル規模とともに増加することを示し、より強力な NCO ソルバーを訓練することは下流の目的にとってより有用なエンコーダーを生み出すことを示唆しています。当社のコードは利用可能です：github.com/ReubenNarad/tsp_prescriptive_probe

Original Content

arXiv:2602.07216v1 Announce Type: new Abstract: The field of neural combinatorial optimization (NCO) trains neural policies to solve NP-hard problems such as the traveling salesperson problem (TSP). We ask whether, beyond producing good tours, a trained TSP solver learns internal representations that transfer to other optimization-relevant objectives, in the spirit of transfer learning from other domains. We train several attention-based TSP policies, collect their internal activations, and train probes on node/edge embeddings for two NP-hard prescriptive downstream tasks inspired by real-world logistics scenarios: node-removal sensitivity (identifying the most impactful node to remove) and edge-forbid sensitivity (identifying the most critical edge to retain). On a Euclidean TSP100-trained model, probes for both tasks are competitive with existing baselines. Ensembling probe signals with geometric features outperforms the strongest baselines: 65\% top-1 accuracy (vs. 58\% baseline) for the best-node-removal task, and 73\% top-1 accuracy (vs. 67\% baseline) for the worst-edge identification task. To our knowledge, we are the first to study neural TSP solvers as transferable encoders for prescriptive what-if decision-support objectives beyond tour construction. Finally, we show that transfer accuracy increases with solver quality across training and model scale, suggesting that training stronger NCO solvers also yields more useful encoders for downstream objectives. Our code is available at: github.com/ReubenNarad/tsp_prescriptive_probe