arxiv_cs_lg 2026年2月10日

Maximum Discrimination を用いたグラフに基づく半教師済み学習

Graph-based Semi-Supervised Learning via Maximum Discrimination

Translated: 2026/3/15 7:04:51

semi-supervised-learninggraph-representationsmaximum-discriminationmachine-learningdeep-learning

Japanese Translation

arXiv:2602.08042v1 Announce Type: cross 要約: 半教師済み学習（SSL）は、ラベル付けされたデータが豊富にあるが、ラベル付けされていないデータが豊富である状況で、高精度なモデルをトレーニングするという重要な課題に対処します。グラフベースの SSL（GSSL）は、グラフ表現を通じてデータ構造を捉えることで広く採用されたフレームワークとして台頭しました。ラベル伝播やラベル広延など、従来のグラフベース SSL 手法は、同一ラベルを持つ点が一様表現空間で近接するよう、低次元の表現を計算することを目的としています。これらの手法はしばしば効果的ですが、複雑なラベル分布を持つデータにおいては最適解とならないことがあります。本研究では、クラス間の区別が最大化される低次元の表現を計算する、グラフアプローチである AUC-spec を開発しました。この表現は、ラベル付けされた点を介して推定される ROC 曲線下の領域（AUC）を最大化するように最適化することで計算されます。我々は、積多様体モデル（product-of-manifold model）における当アプローチについて詳細な分析を提供し、AUC-spec に必要なラベル付けされた点の数はモデルパラメータに対して多項式的であることを示しました。実験において、AUC-spec はクラス間分離性とグラフ滑らかさをバランスさせると示されました。合成データセットおよび現実世界のデータセットで競合的な結果を示す一方、従来の手法および最先端の手法との計算効率の比較可能性を維持しています。

Original Content

arXiv:2602.08042v1 Announce Type: cross Abstract: Semi-supervised learning (SSL) addresses the critical challenge of training accurate models when labeled data is scarce but unlabeled data is abundant. Graph-based SSL (GSSL) has emerged as a popular framework that captures data structure through graph representations. Classic graph SSL methods, such as Label Propagation and Label Spreading, aim to compute low-dimensional representations where points with the same labels are close in representation space. Although often effective, these methods can be suboptimal on data with complex label distributions. In our work, we develop AUC-spec, a graph approach that computes a low-dimensional representation that maximizes class separation. We compute this representation by optimizing the Area Under the ROC Curve (AUC) as estimated via the labeled points. We provide a detailed analysis of our approach under a product-of-manifold model, and show that the required number of labeled points for AUC-spec is polynomial in the model parameters. Empirically, we show that AUC-spec balances class separation with graph smoothness. It demonstrates competitive results on synthetic and real-world datasets while maintaining computational efficiency comparable to the field's classic and state-of-the-art methods.