AIDB Daily Papers
LLMの幻覚(ハルシネーション)をほぼ特徴づける「イノベーション」
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- LLMの幻覚を理解するため、新たな指標「イノベーション」を定義し、その性質を分析した。
- イノベーションは、モデルが訓練データ外の出力を生成する傾向を測り、幻覚の発生と密接に関連する。
- イノベーション率に基づき、訓練データの不完全さ(ミッシングマス)から生じる幻覚の下限値を新たな手法で導出した。
Abstract
Hallucination is a central limitation of large language models (LLMs), and substantial effort has been devoted to understanding and mitigating it. Towards this, Kalai and Vempala (STOC 2024) introduced a probabilistic framework formalizing calibration and hallucination, and showed that, with high probability, calibrated LLMs hallucinate roughly at the rate of the "missing mass", a measure of how incomplete the training data is relative to its source. This raises two fundamental questions: (i) what property of a calibrated LLM makes hallucinations unavoidable? and (ii) can hallucinations be avoided by giving up calibration? We answer these questions by introducing a simpler property we call innovation that measures the tendency of a model to produce outputs outside the training data. We show that innovation is implied by the condition for hallucination identified by Kalai and Vempala, and, further, that it is an almost characterization of hallucination: hallucination implies innovation, and conversely, innovation implies hallucination with high probability. We also provide lower bounds on the hallucination rate based on the "innovation rate", and by relating innovation rate back to missing mass, we obtain new hallucination rate lower bounds based on missing mass that extend the results of Kalai and Vempala.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: