AIDB Daily Papers
MIND-Skill: 多様なAIエージェントによる誘導と演繹を用いた高品質なスキル自動生成
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 成功した実行軌跡から汎用的なスキルを自動で生成するMIND-Skillフレームワークを提案した。
- 人間の専門家による手作業を不要にし、複雑な実世界タスクにおけるAIエージェントの能力向上を目指す点で重要である。
- 再構成損失、結果損失、ルーブリック損失を組み合わせることで、高品質かつ汎用的なスキル生成を実現した。
Abstract
Large language model (LLM) powered AI agents have emerged as a promising paradigm for autonomous problem-solving, yet they continue to struggle with complex, multi-step real-world tasks that demand domain-specific procedural knowledge. Reusable agent skills, which encapsulate successful problem-solving strategies, offer a natural remedy by enabling agents to build on prior experience. However, curating such skills has largely remained a manual endeavor, requiring human experts to distill rich domain knowledge into actionable guidelines. In this work, we present $textbf{M}$ulti-agent $textbf{IN}$duction and $textbf{D}$eduction for $textbf{Skill}$s ($textbf{MIND-Skill}$), a framework that automatically induces generalizable skills from successful trajectories with robust quality guarantees. MIND-Skill consists of an induction agent which is tasked to abstract reusable skills from successful trajectories, and a deduction agent which aims to reconstruct trajectories by following the induced skills. To guarantee the quality of the generated skills, we introduce a reconstruction loss that compares input and reconstructed trajectories, an outcome loss that enforces the correctness of the reconstructed trajectories, and a rubric loss that assesses the documentation quality and regularizes the abstraction level of the generated skills according to predefined criteria. These textual losses are jointly optimized with TextGrad, and the resulting skills are evaluated on held-out tasks unseen during optimization. Experiments on AppWorld and BFCL-v3 show that MIND-Skill consistently outperforms concurrent skill generation methods.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: