AIDB Daily Papers

帰属理論に基づく推論における社会的バイアス評価のための日本語ベンチマーク

原題: A Japanese Benchmark for Evaluating Social Bias in Reasoning Based on Attribution Theory

著者: Taihei Shiotani, Masahiro Kaneko, Naoaki Okazaki

公開日: 2026-04-01 | 分野: LLM NLP データセットベンチマーク AI 評価心理社会倫理研究日本語バイアス

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

大規模言語モデルの公平性向上のため、特定の言語地域の文化的背景に根ざした社会的バイアスを評価する。
既存の日本語ベンチマークは英語データの翻訳に依存し、推論におけるバイアスを捉えきれていない点が課題である。
帰属理論に基づき、結論を固定しつつ、内集団と外集団への行動の帰属におけるバイアスを評価するデータセットを構築した。

Abstract

In enhancing the fairness of Large Language Models (LLMs), evaluating social biases rooted in the cultural contexts of specific linguistic regions is essential. However, most existing Japanese benchmarks heavily rely on translating English data, which does not necessarily provide an evaluation suitable for Japanese culture. Furthermore, they only evaluate bias in the conclusion, failing to capture biases lurking in the reasoning. In this study, based on attribution theory in social psychology, we constructed a new dataset, ``JUBAKU-v2,'' which evaluates the bias in attributing behaviors to in-groups and out-groups within reasoning while fixing the conclusion. This dataset consists of 216 examples reflecting cultural biases specific to Japan. Experimental results verified that it can detect performance differences across models more sensitively than existing benchmarks.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2604.00568
カテゴリ: cs.CL

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報