AIDB Daily Papers
SkillGuard:AIエージェントのスキル利用を安全に管理する権限フレームワーク
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- AIエージェントのスキル利用におけるセキュリティとプライバシーのリスクを解消する権限フレームワークSkillGuardを提案した。
- 既存の防御策ではスキルファイルや個々のツール呼び出しの静的検査に留まっていたが、SkillGuardはスキルの意図と実行時の挙動を体系的に結びつける点で重要である。
- SkillGuardは315のスキルを用いた評価で、攻撃成功率を大幅に低減させつつ、有用なタスク実行能力を維持することを示した。
Abstract
Agent skills extend LLM agents with reusable instructions, scripts, tool bindings, and contextual dependencies. However, current skill ecosystems largely rely on trust-based loading and static inspection, leaving a gap between what a skill can inject into an agent's context and what it can cause the agent to do at runtime. This gap introduces new security and privacy risks, and existing defenses primarily inspect skill files statically or regulate individual tool calls, without systematically connecting a skill's declared intent with its runtime behavior. In this paper, we present SkillGuard, a skill-centric permission framework that treats skills as permission-bearing executable artifacts. SkillGuard introduces a dual-plane governance model that jointly regulates context influence and action side effects through skill manifests, runtime access control, user-mediated authorization, deny-by-default enforcement, capability inference, and behavior monitoring. We evaluate SkillGuard on 315 real-world skills and SkillInject. The permission taxonomy covers 99.76% of observed protected objects, and automated manifest generation reaches 91.0% F1. In adversarial evaluations, SkillGuard reduces attack success from 32.37% to 23.02% for contextual injections and from 25.56% to 16.67% for obvious injections, while maintaining benign task utility. These results suggest that SkillGuard, as a skill-centric permission framework, can provide a practical foundation for improving the privacy and security of agent skill ecosystems.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: