AIDB Daily Papers
ペルソナ・プロンプトはいつ本当に役立つのか?LLMにおける専門家役割注入の検索とメトリック分析
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 本研究では、LLMに専門家としての役割を与えるペルソナ・プロンプトの効果を、複数の評価指標を用いて詳細に分析した。
- ペルソナ・プロンプトは、回答の専門性を深める一方で、明瞭性を低下させるトレードオフが存在し、その効果は質問の種類や分野に依存することが明らかになった。
- ハイブリッドな役割検索手法は、埋め込み検索のみの手法よりも優れていたが、専門性と明瞭性のトレードオフを完全に解消するには至らなかった。
Abstract
Persona prompting is widely used to steer large language models, yet its practical value remains unclear. Prior work often evaluates persona prompting using aggregate scores, making it difficult to determine whether expert-role prompting consistently improves response quality or instead changes responses along different quality dimensions. We study this question through a controlled comparison of four prompting conditions across 1,140 open-ended questions spanning 38 expert roles and six domains: no role prompt, a generic domain-expert prompt, embedding-based role retrieval, and a hybrid retrieval method combining embedding search with LLM-based role selection. Aggregate results show only small overall differences between conditions. However, metric-level analysis reveals a consistent tradeoff that aggregate averages obscure: role prompting systematically increases expertise depth while reducing clarity. These effects are highly conditional rather than universal. Role prompting performs best on advisory questions and in domains such as medicine and psychology, where structured expert framing and risk communication are intrinsically valuable. In contrast, baseline prompting performs better on conceptual and explanatory questions in finance, legal, science, and technology domains, where concise plain-language explanation is more important. We further show that hybrid retrieval significantly improves over embedding-only role selection, although better role retrieval does not eliminate the broader expertise-depth versus clarity tradeoff. Overall, our findings suggest that persona prompting primarily reshapes response characteristics rather than broadly improving capability, and that multi-metric evaluation is necessary for understanding its effects.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: