arxiv_cs_ai 2026年4月24日

RV-HATE: 明示的否定的嫌悪発言検出を強化するマルチモジュール投票システム

RV-HATE: Reinforced Multi-Module Voting for Implicit Hate Speech Detection

Translated: 2026/4/24 20:32:54

reinforcement-learninghate-speech-detectionmulti-module-votingnatural-language-processingarxiv

Japanese Translation

arXiv:2510.10971v2 発表形式：replace-cross 要約: 嫌悪発言は依然として人間社会に存在し、その形態や表現は進化し続けています。インターネットの進歩とオンラインのアノニミティは、その急速な拡散を加速させ、検出を複雑にしています。しかし、嫌悪発言のデータセットは、異なるソースやプラットフォームから構成されているため、それぞれが異なる言語的スタイルや社会的文脈を反映しているという多様な特性を示しています。この多様性にもかかわらず、従来の嫌悪発言検出の研究は、データ固有の特性に適応せずに固定的な手法に依存していることが多くあります。我々は、各嫌悪発言データセット固有の特性を考慮하도록設計された検出フレームワーク RV-HATE を提案します。RV-HATE は、多岐にわたる専門的なモジュールから構成されており、各モジュールは嫌悪発言の異なる言語的または文脈的機能に焦点を当てています。このフレームワークは、強化学習を活用して、特定のデータセットにおける各モジュールの寄与度を決定する重み进行最適化します。その後、投票機構がモジュールの出力を統合して最終的な判断を下します。RV-HATE は、(1) データセット固有の属性に検出プロセスを調整することで検出精度を向上させ、(2) 各データセットの固有の特徴に関する解釈可能な洞察を提供するという 2 つの主要な利点を有します。したがって、我々のアプローチは明示的・否定的の嫌悪発言を効果的に対処し、従来の静的な方法と比較して優れた性能を実現しています。当社のコードは、https://github.com/leeyejin1231/RV-HATE から利用可能です。

Original Content

arXiv:2510.10971v2 Announce Type: replace-cross Abstract: Hate speech remains prevalent in human society and continues to evolve in its forms and expressions. Modern advancements in internet and online anonymity accelerate its rapid spread and complicate its detection. However, hate speech datasets exhibit diverse characteristics primarily because they are constructed from different sources and platforms, each reflecting different linguistic styles and social contexts. Despite this diversity, prior studies on hate speech detection often rely on fixed methodologies without adapting to data-specific features. We introduce RV-HATE, a detection framework designed to account for the dataset-specific characteristics of each hate speech dataset. RV-HATE consists of multiple specialized modules, where each module focuses on distinct linguistic or contextual features of hate speech. The framework employs reinforcement learning to optimize weights that determine the contribution of each module for a given dataset. A voting mechanism then aggregates the module outputs to produce the final decision. RV-HATE offers two primary advantages: (1)~it improves detection accuracy by tailoring the detection process to dataset-specific attributes, and (2)~it also provides interpretable insights into the distinctive features of each dataset. Consequently, our approach effectively addresses implicit hate speech and achieves superior performance compared to conventional static methods. Our code is available at https://github.com/leeyejin1231/RV-HATE.