AIDB Daily Papers
思考の語り:大規模言語モデルにおける論破可能な倫理的推論のための推論時足場
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 思考の連鎖(CoT)におけるステークホルダーの崩壊と不確実性の抑制という2つの失敗モードを解決する新しいプロンプト「思考の語り(NoT)」を提案した。
- NoTは、学習やファインチューニングなしに、ステークホルダーの特定と不確実性の明示を構造化し、倫理的推論の質を大幅に向上させる点で重要である。
- NoTはステークホルダーの崩壊を31%から1%未満に、不確実性の抑制を72%から1-24%に削減し、多者間議論では合意形成率を95%以上に向上させた。
Abstract
Standard chain-of-thought on moral dilemmas exhibits two failure modes: stakeholder collapse (the trace names at most one party with a stake in the outcome) and uncertainty suppression (no explicit unknowns or hedges before committing to an action). We introduce narration-of-thought (NoT), a system prompt that structures chain-of-thought into five sections: protagonist, stakeholders, two-step consequences, uncertainty, then commitment. NoT adds no training, parameters, or fine-tuning. On 100 DailyDilemmas scenarios across four generators from three vendors, NoT cuts stakeholder collapse from up to 31% to under 1% and uncertainty suppression from up to 72% to 1-24% on every model. A matched-budget verbose-CoT control rules out token spend as the active ingredient; NoT retains Cliff's delta advantages of +0.79 to +0.90 on stakeholder count and +0.65 to +0.93 on uncertainty score for three of four generators, and a section ablation attributes each shift to its specific sub-instruction. Textual-gradient descent initialised at NoT improves the scaffold further; a cross-family training judge (different vendor from the generator) dominates an in-family one on every measured axis. Extended to a five-round multi-stakeholder debate protocol, the scaffold converts a 6% standoff into 95% full consensus on a calibration set and 100% combined convergence on a DailyDilemmas replication. The resulting traces externalise the stakeholders, consequences, and uncertainty grounding each commitment, providing an auditable substrate for dependable agentic deployment.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: