AIDB Daily Papers
状況に合わせた性格操作:大規模言語モデルの新たな可能性
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 大規模言語モデルの性格を、状況に応じて柔軟に変化させる手法を開発しました。
- 既存手法の制御性と資源効率の限界を克服し、状況依存性と行動パターンに着目した点が新しいです。
- 実験の結果、IRISフレームワークが既存手法を凌駕し、汎用性とロバスト性を示すことができました。
Abstract
Personalized Large Language Models (LLMs) facilitate more natural, human-like interactions in human-centric applications. However, existing personalization methods are constrained by limited controllability and high resource demands. Furthermore, their reliance on static personality modeling restricts adaptability across varying situations. To address these limitations, we first demonstrate the existence of situation-dependency and consistent situation-behavior patterns within LLM personalities through a multi-perspective analysis of persona neurons. Building on these insights, we propose IRIS, a training-free, neuron-based Identify-Retrieve-Steer framework for advanced situational personality steering. Our approach comprises situational persona neuron identification, situation-aware neuron retrieval, and similarity-weighted steering. We empirically validate our framework on PersonalityBench and our newly introduced SPBench, a comprehensive situational personality benchmark. Experimental results show that our method surpasses best-performing baselines, demonstrating IRIS's generalization and robustness to complex, unseen situations and different models architecture.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: