AIDB Daily Papers

LLMにおける感情推論のメカニズム解明：構文から感情へ

原題: From Syntax to Emotion: A Mechanistic Analysis of Emotion Inference in LLMs

著者: Bangzhao Shu, Arinjay Singh, Mai ElSherief

公開日: 2026-04-28 | 分野: LLM NLP 解釈性感情自然言語処理 XAI 大規模言語モデル cs.CL

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

本研究は、LLMが感情をどのように認識するか、その内部メカニズムをスパースオートエンコーダーを用いて解析した。
感情認識は最終段階で顕著になり、感情共有特徴と特定特徴から構成され、感情ごとに表現の強さや影響力が異なることを発見した。
因果的特徴操作手法を提案し、LLMの感情認識性能を向上させつつ、言語モデリング能力を維持することに成功した。

Abstract

Large language models (LLMs) are increasingly used in emotionally sensitive human-AI applications, yet little is known about how emotion recognition is internally represented. In this work, we investigate the internal mechanisms of emotion recognition in LLMs using sparse autoencoders (SAEs). By analyzing sparse feature activations across layers, we identify a consistent three-phase information flow, in which emotion-related features emerge only in the final phase. We further show that emotion representations comprise both shared features across emotions and emotion-specific features. Using phase-stratified causal tracing, we identify a small set of features that strongly influence emotion predictions, and show that both their number and causal impact vary across emotions; in particular, Disgust is more weakly and diffusely represented than other emotions. Finally, we propose an interpretable and data-efficient causal feature steering method that significantly improves emotion recognition performance across multiple models while largely preserving language modeling ability, and demonstrate that these improvements generalize across multiple emotion recognition datasets. Overall, our findings provide a systematic analysis of the internal mechanisms underlying emotion recognition in LLMs and introduce an efficient, interpretable, and controllable approach for improving model performance.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2604.25866
カテゴリ: cs.CL

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報