AIDB Daily Papers

プランナーが鍵！長期間タスク計画のための効率的・不均衡型マルチエージェント協調フレームワーク

原題: Planner Matters! An Efficient and Unbalanced Multi-agent Collaboration Framework for Long-horizon Planning

著者: Wenyi Wu, Sibo Zhu, Kun Zhou, Biwei Huang

公開日: 2026-05-04 | 分野: 強化学習 AI マルチエージェント VLM cs.AI cs.MA cs.LG AIエージェント長期間計画

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

長期間タスク計画の課題に対し、計画・実行・記憶管理の3役割に分解するマルチエージェントフレームワークを提案した。
計算資源配分分析から、計画段階がタスク性能に最も影響し、実行・記憶管理は少ない計算資源で十分であることを明らかにした。
計画に特化した強化学習とVLM評価により、計算効率と性能を両立させ、長期間エージェント自動化を大幅に改善した。

Abstract

Language model (LM)-based agents have demonstrated promising capabilities in automating complex tasks from natural language instructions, yet they continue to struggle with long-horizon planning and reasoning. To address this, we propose an enhanced multi-agent framework that decomposes automation into three roles: a planner for high-level decision-making, an actor for task execution, and a memory manager for contextual reasoning. While this modular decomposition aligns with established design patterns, our core contribution lies in a systematic compute-allocation analysis, revealing that planning is the dominant factor influencing task performance. Execution and memory management require significantly less compute and model capacity to achieve competitive results. Building on these insights, we introduce a planner-centric reinforcement learning approach, which exclusively optimizes the planner using trajectory-level rewards from a VLM-as-judge, while freezing the other components. Extensive experiments on benchmarks spanning web navigation, OS control, and tool use demonstrate that concentrating model capacity and learning on high-level planning yields robust and compute-efficient improvements in long-horizon agent automation. Our code is publicly released.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.02168
カテゴリ: cs.AI, cs.LG, cs.MA

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報