AIDB Daily Papers

LLMエージェントのスキル仕様におけるユーザー理解支援の実現に向けて

原題: Toward User Comprehension Supports for LLM Agent Skill Specifications

著者: Zikai Alex Wen

公開日: 2026-05-19 | 分野: LLM cs.AI cs.HC AIエージェント HCI AI支援

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

LLMエージェントのスキル仕様が、ユーザーの期待形成をどの程度支援するかを調査した。
既存の監査は安全性に焦点を当てるが、本研究は仕様の理解しやすさという補完的な側面に注目した。
仕様に具体的な例や4つの理解指標が不足していると、ユーザーはスキルの機能や入出力を把握しにくいことが判明した。

Abstract

Users often interpret and select agent skills through their SKILL markdown specifications. To protect users, existing audits mainly focus on malicious or unsafe skills. We study the complementary question of whether specifications help users form bounded expectations about what a skill consumes, produces, and covers. Across 878 cybersecurity skills, we used rule-based coding to measure textual cues for four comprehension anchors, namely operational basis, output contract, boundary disclosure, and example capability demonstration. Cues for operational basis were common, but only 19.0% of specifications exhibited cues for an example task, sample, or expected outcome, and only 2.3% exhibited cues for all four anchors. We further examined a small DNS/C2 telemetry subset (n$=$6) to illustrate why missing examples may matter. Examples appeared to make first local checks easier to construct, while no-example skills typically required helper code inspection to recover command arguments or output fields. We argue that agent-skill evaluation should treat specifications as user-facing capability disclosures, not merely as containers for executable instructions.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.19362
カテゴリ: cs.HC, cs.AI

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報