AIDB Daily Papers
SoftSkill:文脈適応のための行動圧縮
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 自然言語で記述されたエージェントのスキルを、学習可能なコンパクトな連続コンテキストオブジェクトとして初期化する手法を提案した。
- この手法は、大規模言語モデルの推論時に、従来のMarkdown形式のスキル記述よりも効率的に振る舞いを初期化できる点で重要である。
- SoftSkillは、Qwen3.5-4Bモデルにおいて、検索QAやLiveMathなどのタスクで大幅な精度向上を達成し、スキル記述を仮想トークンに置き換えた。
Abstract
Agent skills are commonly deployed as natural-language Markdown files that encode answer policies, evidence-use habits, and task procedures. These files are readable and portable, but they are consumed indirectly: for each task instance, a frozen language model must translate a long textual artifact into generation-time behavior. This paper asks whether a natural-language skill can instead initialize a compact continuous context object, refined by a trainable soft delta while the base model remains frozen. We propose SoftSkill, a frozen-backbone method that tunes such soft skills with next-token prediction and deploys them as latent behavioral priors at inference time. In our main single-round setting, a length-32 SoftSkill prefix on Qwen3.5-4B improves over no-skill prompting by 8.3 points on SearchQA, 42.1 points on LiveMath, and 1.3 points on DocVQA. Relative to SkillOpt, SoftSkill improves accuracy by 5.2 points on SearchQA and 12.5 points on LiveMath, while replacing hundreds to thousands of Markdown skill tokens with a few virtual tokens. We further study agentic execution as a harder boundary case, where sparse trajectory imitation provides useful signal but does not yet robustly compress long-horizon procedural behavior. More broadly, the results suggest that some task skills are better treated not as additional Markdown to be reinterpreted at inference time, but as compact latent controls over how a frozen model enters the task.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: