AIDB Daily Papers

質問に文書を合わせる：検索拡張生成のための質問指向文書書き換え

原題: Align Documents to Questions: Question-Oriented Document Rewriting for Retrieval-Augmented Generation

著者: Jiaang Li, Zhendong Mao, Quan Wang, Yuning Wan, Yongdong Zhang

公開日: 2026-04-19 | 分野: LLM NLP Transformer RAG cs.CL

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

検索拡張生成（RAG）において、LLMが生成された内容を優先し、検索された証拠を無視する問題を解決する手法を提案した。
質問のスタイルに合わせて文書を書き換えることで、LLMが事実に基づいた情報をより活用できるようにする。
提案手法QREAMは、既存のRAGパイプラインに組み込むことで、事実性と関連性のバランスを向上させ、性能を改善した。

Abstract

Retrieval-Augmented Generation (RAG) enhances the factuality of Large Language Models (LLMs) by incorporating retrieved documents and/or generated context. However, LLMs often exhibit a stylistic bias when presented with mixed contexts, favoring fluent but hallucinated generated content over factually grounded yet disorganized retrieved evidence. This phenomenon reveals that the utility of retrieved information is bottlenecked by its presentation. To bridge this gap, we propose QREAM, a style-controlled rewriter that aligns retrieved documents with a question-oriented style while preserving facts, better for LLM readers to utilize. Our framework consists of two stages: (1) QREAM-ICL, which uses stylistic seeds to guide iterative rewriting exploration; and (2) QREAM-FT, a lightweight student model distilled from denoised ICL outputs. QREAM-FT employs dual-criteria rejection sampling, filtering based on answer correctness and factual consistency to ensure high-quality supervision. QREAM seamlessly integrates into existing RAG pipelines as a plug-and-play module. Experiments demonstrate that QREAM consistently enhances advanced RAG pipelines, yielding up to 8% relative improvement with negligible latency overhead, effectively balancing question relevance with factual grounding.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2604.17325
カテゴリ: cs.CL

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報