AIDB Daily Papers

AIエージェントのための自動化設計：究極のハーネスエンジニアリング

原題: The Last Harness You'll Ever Build

著者: Haebin Seong, Li Yin, Haoran Zhang

公開日: 2026-04-22 | 分野: ロボティクス機械学習 AI エージェント自動化 cs.AI プロンプトエンジニアリング

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

AIエージェントが複雑な業務を遂行するために必要なプロンプトやツールなどの「ハーネス」を自動設計するフレームワークを提案した。
従来、タスクごとに専門家が手作業で行っていたハーネスエンジニアリングを自動化し、AIによる自動化設計そのものを学習させる点が重要である。
提案手法により、AIエージェントは新たな業務ドメインへの適応が迅速化し、人間によるハーネス設計が不要となることが示された。

Abstract

AI agents are increasingly deployed on complex, domain-specific workflows -- navigating enterprise web applications that require dozens of clicks and form fills, orchestrating multi-step research pipelines that span search, extraction, and synthesis, automating code review across unfamiliar repositories, and handling customer escalations that demand nuanced domain knowledge. textbf{Each new task domain requires painstaking, expert-driven harness engineering}: designing the prompts, tools, orchestration logic, and evaluation criteria that make a foundation model effective. We present a two-level framework that automates this process. At the first level, the textbf{Harness Evolution Loop} optimizes a worker agent's harness $mathcal{H}$ for a single task: a Worker Agent $W_{mathcal{H}}$ executes the task, an Evaluator Agent $V$ adversarially diagnoses failures and scores performance, and an Evolution Agent $E$ modifies the harness based on the full history of prior attempts. At the second level, the textbf{Meta-Evolution Loop} optimizes the evolution protocol $Λ= (W_{mathcal{H}}, mathcal{H}^{(0)}, V, E)$ itself across diverse tasks, textbf{learning a protocol $Λ^{(text{best})}$ that enables rapid harness convergence on any new task -- so that adapting an agent to a novel domain requires no human harness engineering at all.} We formalize the correspondence to meta-learning and present both algorithms. The framework textbf{shifts manual harness engineering into automated harness engineering}, and takes one step further -- textbf{automating the design of the automation itself}.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2604.21003
カテゴリ: cs.AI

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報