AIDB Daily Papers

Web上の間接プロンプトインジェクション：その実態、手法、目的の実証的研究

原題: Indirect Prompt Injection in the Wild: An Empirical Study of Prevalence, Techniques, and Objectives

著者: Soheil Khodayari, Xuenan Zhang, Bhupendra Acharya, Giancarlo Pellegrino

公開日: 2026-04-29 | 分野: LLM セキュリティ AI 自然言語処理 cs.CR 信頼性

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

Webページを信頼できない入力元として、LLMの動作を不正に操作する間接プロンプトインジェクションの実態を大規模に調査した。
Web上の間接プロンプトインジェクションは既に蔓延しており、AIボット検出やコンテンツ保護など多様な目的で利用されている。
実験の結果、LLMは間接プロンプトインジェクションの影響を限定的だが無視できない程度に受けることが示された。

Abstract

As LLMs are increasingly integrated into systems that browse, retrieve, summarize, and act on web content, webpages have become an untrusted input vector for downstream model behavior. This enables site owners, contributors, and adversaries to embed instructions directly in web resources, i.e., indirect prompt injections. While prior work demonstrates such attacks in controlled settings, their prevalence, deployment, and real-world impact remain unclear. We present one of the first large-scale empirical analyses of indirect prompt injections in webpages and HTTP responses. Analyzing 1.2B URLs from 24.8M hosts, we identify 15.3K validated instances across 11.7K pages. These are not isolated cases: a small number of recurring templates account for most cases. We characterize their objectives, delivery mechanisms, visibility, persistence, and impact, revealing a heterogeneous ecosystem spanning disruptive prompts, reputation manipulation, content-protection directives, and AI-bot detection, targeting systems such as crawlers, search pipelines, customer-support agents, and hiring workflows. A key finding is that most instructions target machines rather than humans: about 70% appear in non-rendered HTML (e.g., headers, comments, metadata), and many visible cases are hidden via rendering techniques. To assess practical risk, we run 5,200 controlled experiments across 13 models and four webpage representations. Our results show compliance is limited but non-negligible, reaching up to 8% for smaller models on plain-text inputs, while structured representations reduce compliance by preserving structural cues. Overall, prompt-based interference is already present in the web ecosystem and represents a growing source of tension between LLM-driven automation and the sites it consumes.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2604.27202
カテゴリ: cs.CR

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報