arxiv_cs_gr 2026年4月14日

オンライン教育プラットフォームにおけるユーザー満足度の予測：大規模言語モデルに基づくマルチモーダルレビューマイニング枠組み

Predicting User Satisfaction in Online Education Platforms: A Large Language Model Based Multi-Modal Review Mining Framework

Open original article

Translated: 2026/4/14 14:15:31

llmmultimodalmachine-learningsentiment-analysislearning-analytics

Japanese Translation

arXiv:2604.11723v1 Announce Type: new 要旨: オンライン教育プラットフォームは過去 10 年間にわたり爆発的な成長を遂げ、レビュー、評価、行動ログという形で膨大なユーザー生成コンテンツを生成しました。これらの異種的信号は、学習者の満足度を理解する上で画期的な機会をもたらしており、これはコースの定着率、エンゲージメント、および長期的な学習成果の決定的要因です。ただし、オンラインレビューの短小さ、ノイズ、文脈依存性、および多面的な性質ゆえに満足度を正確に予測することは依然として困難です。本稿では、プラットフォームレベルおよびコースレベルの両方の学習者の満足度を予測するために、統一された extbf{大規模言語モデル (LLM) に基づくマルチモーダル枠組み} を提案します。提案された枠組みは、以下の 3 つの補完的な情報源を統合します：(1) 潜在の主題構造を捕捉する短テキストのトピック分布、(2) 事前学習されたトランスフォーマーベースの言語モデルから学習された文脈化された感情表現、(3) 学習者の活動ログから派生した行動相互作用特性。これらの異種的信号は、混合回帰アーキテクチャ内で融合され、精度の高い満足度予測が生成されます。我々は、複数のパブリックプラットフォームから収集された大規模 MOOC レビューデータセットを対象として大規模な実験を行いました。実験結果は、提案された LLM に基づくマルチモーダル枠組みが、従来のテキストのみモデル、浅層感情基準、および単一モード回帰アプローチと一貫して優れていることを示しています。包括的なアブレーション研究により、主題の文法、深層感情表現、および行動分析を同時にモデル化する必要性がさらに検証されました。我々の発見は、大規模な文脈言語表現が学習分析を推進する上で重要な役割を果たしていることを示しており、プラットフォーム設計、カリキュラム改善、および個人化された推薦のための具体的な洞察を提供します。

Original Content

arXiv:2604.11723v1 Announce Type: new Abstract: Online education platforms have experienced explosive growth over the past decade, generating massive volumes of user-generated content in the form of reviews, ratings, and behavioral logs. These heterogeneous signals provide unprecedented opportunities for understanding learner satisfaction, which is a critical determinant of course retention, engagement, and long-term learning outcomes. However, accurately predicting satisfaction remains challenging due to the short length, noise, contextual dependency, and multi-dimensional nature of online reviews. In this paper, we propose a unified \textbf{Large Language Model (LLM)-based multi-modal framework} for predicting both platform-level and course-level learner satisfaction. The proposed framework integrates three complementary information sources: (1) short-text topic distributions that capture latent thematic structures, (2) contextualized sentiment representations learned from pretrained Transformer-based language models, and (3) behavioral interaction features derived from learner activity logs. These heterogeneous representations are fused within a hybrid regression architecture to produce accurate satisfaction predictions. We conduct extensive experiments on large-scale MOOC review datasets collected from multiple public platforms. The experimental results demonstrate that the proposed LLM-based multi-modal framework consistently outperforms traditional text-only models, shallow sentiment baselines, and single-modality regression approaches. Comprehensive ablation studies further validate the necessity of jointly modeling topic semantics, deep sentiment representations, and behavioral analytics. Our findings highlight the critical role of large-scale contextual language representations in advancing learning analytics and provide actionable insights for platform design, course improvement, and personalized recommendation.