AIDB Daily Papers

LLMの性能に対するプロンプト要素の影響を理解するための回帰分析フレームワーク

原題: A Regression Framework for Understanding Prompt Component Impact on LLM Performance

著者: Andrew Lauziere, Jonathan Daugherty, Taisa Kushner

公開日: 2026-03-27 | 分野: LLM 統計機械学習評価プロンプト分析モデル自然言語処理性能回帰分析 XAI

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

LLMの性能に影響を与えるプロンプトの特定要素を理解するための統計的フレームワークを提案した。
既存のXAI手法を拡張し、プロンプトの各部分とLLMの評価を関連付ける回帰モデルを構築することでLLMを詳細に分析する。
Mistral-7BとGPT-OSS-20Bに対し、算術問題のプロンプトにおける誤情報が性能を阻害することを発見した。

Abstract

As large language models (LLMs) continue to improve and see further integration into software systems, so does the need to understand the conditions in which they will perform. We contribute a statistical framework for understanding the impact of specific prompt features on LLM performance. The approach extends previous explainable artificial intelligence (XAI) methods specifically to inspect LLMs by fitting regression models relating portions of the prompt to LLM evaluation. We apply our method to compare how two open-source models, Mistral-7B and GPT-OSS-20B, leverage the prompt to perform a simple arithmetic problem. Regression models of individual prompt portions explain 72% and 77% of variation in model performances, respectively. We find misinformation in the form of incorrect example query-answer pairs impedes both models from solving the arithmetic query, though positive examples do not find significant variability in the impact of positive and negative instructions - these prompts have contradictory effects on model performance. The framework serves as a tool for decision makers in critical scenarios to gain granular insight into how the prompt influences an LLM to solve a task.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2603.26830
カテゴリ: cs.LG, cs.AI, cs.SE

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報