arxiv_cs_cv 2026年4月20日

脆弱データ主体から被害を生むデータ実践へ：プラットフォーム化された人生の AI 分析における保護の矛盾への導航

From Vulnerable Data Subjects to Vulnerabilizing Data Practices: Navigating the Protection Paradox in AI-Based Analyses of Platformized Lives

Open original article

Translated: 2026/4/20 10:48:23

artificial-intelligencedata-ethicsplatform-societydata-vulnerabilityai-for-good

Japanese Translation

arXiv:2604.15990v1 発表タイプ: クロス要旨：本研究論文は、データ主体の脆弱性を静的な、本質化した特徴として理解する概念の転換を辿り、それがデータ実践を通じて能動的に実行されている方法を検証します。反復的倫理枠組みが欠如や反データに焦点を当てるのに対し、私たちは、ほぼ耗尽しないデータポイントの量に already 存在するプラットフォーム化された人生に内在する豊富さの条件に対して対処し、研究倫理上の課題をこの既存の大量データに対する研究家の選択へとシフトさせます。我々は、データサイエンスの倫理的正しさが、誰が研究対象となるだけでなく、「脆弱」な個人がデータ主体として再構成され、その脆弱性がさらに不安定化されるように技術パイプラインがどのように変換するかという点にも依存すると主張します。この主張は、AI for Social Good (AI4SG) のケーススタディを通じて発展させます：ジャーナリストが、規制上のアドボカシーのために、収益化された YouTube の「家族 Vlog」における子供の存在を定量化するためにコンピュータビジョンを活用することを要求したケースです。このケースは「保護の矛盾」を明らかにします：脆弱な主体を保護しようとするデータ驱动の努力が、意図せず新しい形態の計算上の暴露、還元主義、および抽出を課すという点です。この要求を出発点として、我々は AI パイプライン的方法論的分解を行い、粒度の高い技術的決定が倫理的に構成的な方法を示します。我々は、これらの洞察を研究倫理の反射的なロードマップへと翻訳する反射的倫理のプロトコルに貢献します。このプロトコルは、データセットの設計、運用化、推論、および伝達という 4 つの重要な接点を中心に据え、親切な意図の作業が再なる抽出や暴露へと滑り込む可能性のある技術的問題と倫理上の緊張関係を特定します。すべての決定点については、このプロトコルは暴露、収益化、物語の固定化、およびアルゴリズムの最適化という 4 つの横断する脆弱化因子をナビゲートするための具体的なプロンプトを提供します。批判的でない...という形で

Original Content

arXiv:2604.15990v1 Announce Type: cross Abstract: This paper traces a conceptual shift from understanding vulnerability as a static, essentialized property of data subjects to examining how it is actively enacted through data practices. Unlike reflexive ethical frameworks focused on missing or counter-data, we address the condition of abundance inherent to platformized life-a context where a near inexhaustible mass of data points already exists, shifting the ethical challenge to the researcher's choices in operating upon this existing mass. We argue that the ethical integrity of data science depends not just on who is studied, but on how technical pipelines transform "vulnerable" individuals into data subjects whose vulnerability can be further precarized. We develop this argument through an AI for Social Good (AI4SG) case: a journalist's request to use computer vision to quantify child presence in monetized YouTube 'family vlogs' for regulatory advocacy. This case reveals a "protection paradox": how data-driven efforts to protect vulnerable subjects can inadvertently impose new forms of computational exposure, reductionism, and extraction. Using this request as a point of departure, we perform a methodological deconstruction of the AI pipeline to show how granular technical decisions are ethically constitutive. We contribute a reflexive ethics protocol that translates these insights into a reflexive roadmap for research ethics surrounding platformized data subjects. Organized around four critical junctures-dataset design, operationalization, inference, and dissemination-the protocol identifies technical questions and ethical tensions where well-intentioned work can slide into renewed extraction or exposure. For every decision point, the protocol offers specific prompts to navigate four cross-cutting vulnerabilizing factors: exposure, monetization, narrative fixing, and algorithmic optimization. Rather than uncritically...