AIDB Daily Papers
Fusian:LLMにおける性格特性のきめ細かい連続制御のためのMulti-LoRA融合
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 大規模言語モデルの性格制御において、連続的な特性強度を精密に調整するFusianという新しいフレームワークを提案した。
- 従来の離散的な性格分類ではなく、LoRAアダプターの動的な融合により、性格特性の連続的なマニフォールドを捉える点が新しい。
- Qwen3-14Bモデルでの実験により、Fusianは性格制御において高い精度を示し、ユーザー指定の特性強度への適合で既存手法を凌駕した。
Abstract
Large Language Models (LLMs) have demonstrated impressive capabilities in simulating diverse human behaviors and personalities. However, existing methods for personality control, which include prompt engineering and standard Supervised Fine-Tuning (SFT), typically treat personality traits as discrete categories (e.g., "Extroverted" vs. "Introverted"), lacking the ability to precisely control the intensity of a trait on a continuous spectrum. In this paper, we introduce Fusian, a novel framework for fine-grained, continuous personality control in LLMs. Fusian operates in two stages: (1) Trajectory Collection, where we capture the dynamic evolution of personality adoption during SFT by saving a sequence of LoRA adapters, effectively mapping the continuous manifold of a trait; and (2) RL-based Dynamic Fusion, where we train a policy network using Reinforcement Learning to dynamically compute mixing weights for these frozen adapters. By sampling from a Dirichlet distribution parameterized by the policy network, Fusian fuses multiple adapters to align the model's output with a specific numerical target intensity. Experiments on the Qwen3-14B model demonstrate that Fusian achieves high precision in personality control, significantly outperforming baseline methods in aligning with user-specified trait intensities.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: