AIDB Daily Papers
LLMスキル記述の真偽:コード実装に潜む未開示のセキュリティ挙動を検出
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- LLMスキルにおける自然言語記述と実行コード実装の間のセキュリティ関連挙動の不一致を調査した。
- この研究は、LLMスキル実装が記述されたセキュリティ範囲を超えていないかを検証する点で重要である。
- 提案手法SKILLSCOPEは、9.4%のスキルに未開示のセキュリティ挙動が存在することを特定し、高い精度と再現率を示した。
Abstract
Programmatic skills in LLM ecosystems consist of a natural-language description and executable implementation files. Users and LLMs rely on the description to understand the skill's scope. However, the implementation may perform security-relevant operations, such as credential access, network communication, or command execution, that the description does not state. We study this description--implementation inconsistency by asking whether the implementation stays within the security-relevant scope declared in the description. We manually analyze 920 real-world programmatic skills and construct an 11-category security property taxonomy. Based on this taxonomy, we build SKILLSCOPE, which constructs source-level security property graphs (SPGs) from implementations and performs LLM-assisted consistency checking. SPG nodes retain source-level code patterns rather than abstract taxonomy labels, preserving fine-grained evidence for checking. On 4,556 programmatic skills with double-blind human review, SKILLSCOPE achieves a precision of 84.8% and a recall of 96.5% for identifying inconsistency. Confirmed inconsistency affects 9.4% of skills, while cases of coarser description, in which implementation details remain within the declared scope, account for 24.3%. Ablation experiments confirm that both the SPG and the taxonomy contribute: removing the taxonomy reduces precision from 87.8% to 72.3%, while removing the SPG reduces recall from 94.7% to 79.0%.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: