AIDB Daily Papers
CalBench:マルチエージェントLLMにおける協調とプライバシーのトレードオフ評価
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- カレンダー調整タスクを通じてマルチエージェントの協調を評価する環境CalBenchを開発した。
- プライベートな情報を持つエージェント間の効果的なコミュニケーションと、タスク遂行におけるプライバシー保護の重要性を検証する。
- 協調の質、コミュニケーション効率、プライバシー漏洩を正確に測定し、分散型協調プロトコルの研究に貢献する。
Abstract
We introduce CalBench, a controlled evaluation environment for studying multi-agent coordination through calendar scheduling. In CalBench, N agents each manage a private calendar containing pre-existing commitments and must coordinate to schedule a stream of M incoming meetings while minimizing disruption costs. Because agents observe only their own calendars, successful scheduling requires communication across private information boundaries. Each scenario is generated with an oracle solution, enabling precise measurement of coordination quality via realized-to-optimal cost, as well as a Distributed Constraint Optimization (DCOP) baseline to provide a fair comparison under the same private-information constraints. CalBench enables precise verification of task success, communication efficiency, and fairness in the distribution of disruption costs. Our environment also studies privacy-preserving coordination by augmenting calendar entries with private semantic contexts of varying sensitivity and measuring whether agents reveal task-irrelevant private information during negotiation. Unlike multi-agent benchmarks where a single capable agent can often substitute for the group, CalBench is inherently decentralized: no agent has access to another agent's private calendar, yet agents must still reach mutually consistent decisions over shared meeting scheduling. CalBench therefore provides a practical and verifiable setting for studying coordination protocols, communication efficiency, negotiation strategies, fairness, and privacy leakage in multi-agent systems.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: