AIDB Daily Papers
ソフトプロンプトでLLMの幻覚を軽量に抑制する試み
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- ソフトプロンプトを用いて、大規模言語モデルの幻覚(事実に基づかないもっともらしい応答)を抑制する手法を提案しました。
- 本研究は、幻覚抑制、不確実な場合の回答回避、事実の想起をバランスさせることで、LLMの信頼性を高めることを目指しています。
- 実験の結果、提案手法は既存手法と比較して、パラメータの大部分を訓練することなく、幻覚抑制と事実想起のバランスを改善しました。
Abstract
Large language models (LLMs) have seen widespread adoption across various domains, yet their reliability is frequently undermined by hallucinations - responses that are plausible-sounding but factually incorrect. In high-stakes domains, these errors can reduce trust and introduce real-world risk. To address this challenge, we present a parameter-efficient approach that uses soft prompts to mitigate hallucinated content and promote responsible abstention in generative question-answering (QA) tasks. Our method, called Responsible Contrastive Soft Prompting (RCSP), uses a composite loss to train soft prompts that balance three goals: suppressing hallucinatory content, encouraging abstention under uncertainty, and preserving or improving factual recall. To achieve these goals, we incorporate contrastive loss, curriculum learning, and KL regularization into our training mechanism. We evaluate our approach on five diverse generative QA datasets using an LLM-as-a-Judge framework. Experimental results on the Gemma 3 (12B) and Llama 3.1 (8B) backbones demonstrate that RCSP effectively balances factual recall with hallucination suppression and abstention, yielding a generally superior F-score over standard reasoning and instruction-based prompting baselines. Notably, these improvements are achieved by training only a fraction of the parameters required by other tuning techniques. Our results demonstrate that soft prompts provide a modular and computationally efficient path toward improving LLM reliability.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: