arxiv_cs_ai 2026年2月10日

シーケンスとしてのノード：対応型マルチモーダリティ・グラフ推薦

Sequences as Nodes for Contrastive Multimodal Graph Recommendation

Translated: 2026/3/7 12:24:48

machine-learninggraph-recommendationmultimodal-data-analysissequence-models

Japanese Translation

複数種類の材料が混在している推薦システムにおいて、『冷たくなった初歩』やデータ不足に直面する問題を解決するために、多くのマルチモーダル、シーケンシャル（順序的な）そして対応型の技術が提案されています。しかし、これらの強化は推奨性能を向上させる一方で、冗長さや有用な意味を持たないノイズを持ってきました。これを解決するために、私たちの研究では、MU-SIC-REC（マルチモーダルシーケンス・アイテム対応型推奨機）と名付けられたマルチビュー・グラフベースの推薦モデルを提案しました。このシステムは複数の観点からの協力、シーケンシャル、そしてマルチモーダル情報を組み合わせて推薦を行うことを可能にします。まず、利用者が触れたアイテム間の順序に対して、注目パッロングという手法により、序列のノードを作ります。さらなる観点として、序列的な背景情報信号を注入し、同時に、序列的背景情報と共にさらに多模態（文字と画像などの多形種）の情報を組み込むために設計されます。さらに、各モーダルが提供するテキストや画像情報への重視度を管理するために、それぞれのコンテンツとIDを使用したゲートが適用されます。この方法によって、多模態データからなるノイズだけでなく、データ間の整合性もより良く保つことができます。我々は、異なるモデルタイプに対して実施された評価により、Amazon Baby、スポーツ用品、および電子機器でのテストに使用するデータセットを対象にしました。結果的に、MU-SIC-RECはその他の同期的、マルチモーダルと対応型の基準に対しては、最上位でした。我々のコードはhttps://anonymous.4open.science/r/MuSICRec-3CEE/でアクセス可能です。

Original Content

arXiv:2602.07208v1 Announce Type: cross Abstract: To tackle cold-start and data sparsity issues in recommender systems, numerous multimodal, sequential, and contrastive techniques have been proposed. While these augmentations can boost recommendation performance, they tend to add noise and disrupt useful semantics. To address this, we propose MuSICRec (Multimodal Sequence-Item Contrastive Recommender), a multi-view graph-based recommender that combines collaborative, sequential, and multimodal signals. We build a sequence-item (SI) view by attention pooling over the user's interacted items to form sequence nodes. We propagate over the SI graph, obtaining a second view organically as an alternative to artificial data augmentation, while simultaneously injecting sequential context signals. Additionally, to mitigate modality noise and align the multimodal information, the contribution of text and visual features is modulated according to an ID-guided gate. We evaluate under a strict leave-two-out split against a broad range of sequential, multimodal, and contrastive baselines. On the Amazon Baby, Sports, and Electronics datasets, MuSICRec outperforms state-of-the-art baselines across all model types. We observe the largest gains for short-history users, mitigating sparsity and cold-start challenges. Our code is available at https://anonymous.4open.science/r/MuSICRec-3CEE/ and will be made publicly available.