AIDB Daily Papers

LLMにおけるアブダクション（仮説形成）推論：統一的な分類と調査

原題: Wiring the 'Why': A Unified Taxonomy and Survey of Abductive Reasoning in LLMs

著者: Moein Salimi, Shaygan Adim, Danial Parnian, Nima Alighardashi, Mahdi Jafari Siavoshani, Mohammad Hossein Rohban

公開日: 2026-04-09 | 分野: 分析ニューラルネットワーク大規模言語モデル英語深層学習論文調査自然言語処理タスク日本語モデル LLM テキスト言語評価知識 AI 機械学習推論ベンチマークデータセット Transformer

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

大規模言語モデル（LLM）におけるアブダクション推論の初の包括的な調査研究を行った。
哲学的な基礎から最新のAI実装までを辿り、分野に蔓延る概念的な混乱とタスク定義のずれに対処する。
仮説生成と仮説選択の2段階定義を確立し、既存研究を分類整理、LLMの推論能力のギャップを明らかにした。

Abstract

Regardless of its foundational role in human discovery and sense-making, abductive reasoning--the inference of the most plausible explanation for an observation--has been relatively underexplored in Large Language Models (LLMs). Despite the rapid advancement of LLMs, the exploration of abductive reasoning and its diverse facets has thus far been disjointed rather than cohesive. This paper presents the first survey of abductive reasoning in LLMs, tracing its trajectory from philosophical foundations to contemporary AI implementations. To address the widespread conceptual confusion and disjointed task definitions prevalent in the field, we establish a unified two-stage definition that formally categorizes prior work. This definition disentangles abduction into textit{Hypothesis Generation}, where models bridge epistemic gaps to produce candidate explanations, and textit{Hypothesis Selection}, where the generated candidates are evaluated and the most plausible explanation is chosen. Building upon this foundation, we present a comprehensive taxonomy of the literature, categorizing prior work based on their abductive tasks, datasets, underlying methodologies, and evaluation strategies. In order to ground our framework empirically, we conduct a compact benchmark study of current LLMs on abductive tasks, together with targeted comparative analyses across model sizes, model families, evaluation styles, and the distinct generation-versus-selection task typologies. Moreover, by synthesizing recent empirical results, we examine how LLM performance on abductive reasoning relates to deductive and inductive tasks, providing insights into their broader reasoning capabilities. Our analysis reveals critical gaps in current approaches--from static benchmark design and narrow domain coverage to narrow training frameworks and limited mechanistic understanding of abductive processes...

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2604.08016
カテゴリ: cs.AI, cs.LG

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報