AIDB Daily Papers
MOSAIC:マルチエージェント強化学習、LLM、VLM、および人間の意思決定者を統合的に評価・比較する統一プラットフォーム
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 異なる意思決定パラダイムのエージェントを同一環境で比較評価できるオープンソースプラットフォームMOSAICを開発した。
- 強化学習、LLM、VLM、人間など異種エージェントが協調し、再現性のある実験を可能にする点が重要かつ新しい。
- IPCベースのプロトコル、オペレータ抽象化、決定論的な評価フレームワークを提供し、クロスパラダイム研究を促進する。
Abstract
Reinforcement learning (RL), large language models (LLMs), and vision-language models (VLMs) have been widely studied in isolation. However, existing infrastructure lacks the ability to deploy agents from different decision-making paradigms within the same environment, making it difficult to study them in hybrid multi-agent settings or to compare their behaviour fairly under identical conditions. We present MOSAIC, an open-source platform that bridges this gap by incorporating a diverse set of existing reinforcement learning environments and enabling heterogeneous agents (RL policies, LLMs, VLMs, and human players) to operate within them in ad-hoc team settings with reproducible results. MOSAIC introduces three contributions. (i) An IPC-based worker protocol that wraps both native and third-party frameworks as isolated subprocess workers, each executing its native training and inference logic unmodified, communicating through a versioned inter-process protocol. (ii) An operator abstraction that forms an agent-level interface by mapping workers to agents: each operator, regardless of whether it is backed by an RL policy, an LLM, or a human, conforms to a minimal unified interface. (iii) A deterministic cross-paradigm evaluation framework offering two complementary modes: a manual mode that advances up to N concurrent operators in lock-step under shared seeds for fine-grained visual inspection of behavioural differences, and a script mode that drives automated, long-running evaluation through declarative Python scripts, for reproducible experiments. We release MOSAIC as an open, visual-first platform to facilitate reproducible cross-paradigm research across the RL, LLM, and human-in-the-loop communities.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: