AIDB Daily Papers

LLMの幻覚（ハルシネーション）をほぼ特徴づける「イノベーション」

原題: Innovation: An Almost Characterization of Hallucination

著者: Nishant P. Das, Piyush Srivastava

公開日: 2026-05-26 | 分野: LLM cs.AI cs.LG 幻覚 AI安全性

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

LLMの幻覚を理解するため、新たな指標「イノベーション」を定義し、その性質を分析した。
イノベーションは、モデルが訓練データ外の出力を生成する傾向を測り、幻覚の発生と密接に関連する。
イノベーション率に基づき、訓練データの不完全さ（ミッシングマス）から生じる幻覚の下限値を新たな手法で導出した。

Abstract

Hallucination is a central limitation of large language models (LLMs), and substantial effort has been devoted to understanding and mitigating it. Towards this, Kalai and Vempala (STOC 2024) introduced a probabilistic framework formalizing calibration and hallucination, and showed that, with high probability, calibrated LLMs hallucinate roughly at the rate of the "missing mass", a measure of how incomplete the training data is relative to its source. This raises two fundamental questions: (i) what property of a calibrated LLM makes hallucinations unavoidable? and (ii) can hallucinations be avoided by giving up calibration? We answer these questions by introducing a simpler property we call innovation that measures the tendency of a model to produce outputs outside the training data. We show that innovation is implied by the condition for hallucination identified by Kalai and Vempala, and, further, that it is an almost characterization of hallucination: hallucination implies innovation, and conversely, innovation implies hallucination with high probability. We also provide lower bounds on the hallucination rate based on the "innovation rate", and by relating innovation rate back to missing mass, we obtain new hallucination rate lower bounds based on missing mass that extend the results of Kalai and Vempala.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.26808
カテゴリ: cs.LG, cs.AI, cs.IT

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報