AIDB Daily Papers
LLMの不確実性を人間の意見の相違と一致させる:主観性分析における新たなアプローチ
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 本研究は、人間の判断のばらつきを無視する従来のLLM訓練法に代わる、不確実性を考慮した主観性分析を提案する。
- 提案手法DPUAは、意見の相違を感知し、モデルの不確実性を人間の意見の分布に一致させることで、過信を抑制する。
- 実験の結果、DPUAは主観性分析タスクの性能を維持しつつ、モデルの不確実性と人間の意見の相違との一致を改善した。
Abstract
Large language models for subjectivity analysis are typically trained with aggregated labels, which compress variations in human judgment into a single supervision signal. This paradigm overlooks the intrinsic uncertainty of low-agreement samples and often induces overconfident predictions, undermining reliability and generalization in complex subjective settings. In this work, we advocate uncertainty-aware subjectivity analysis, where models are expected to make predictions while expressing uncertainty that reflects human disagreement. To operationalize this perspective, we propose a two-phase Disagreement Perception and Uncertainty Alignment (DPUA) framework. Specifically, DPUA jointly models label prediction, rationale generation, and uncertainty expression under an uncertainty-aware setting. In the disagreement perception phase, adaptive decoupled learning enhances the model's sensitivity to disagreement-related cues while preserving task performance. In the uncertainty alignment phase, GRPO-based reward optimization further improves uncertainty-aware reasoning and aligns the model's confidence expression with the human disagreement distribution. Experiments on three subjectivity analysis tasks show that DPUA preserves task performance while better aligning model uncertainty with human disagreement, mitigating overconfidence on boundary samples, and improving out-of-distribution generalization.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: