AIDB Daily Papers
MindClaw:精密介入のための閉ループ型身体的メンタルステート推論
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- ロボットが他者の信念や意図を理解し、適切なタイミングで介入する閉ループ型の身体的メンタルステート推論フレームワークを構築した。
- 従来の評価手法では不十分だった、環境変化への追従、介入の必要性判断、不要な介入の抑制といった能力を検証する。
- MindClawは、直接的なVLMベースラインと比較して、タスク認識と介入キャリブレーションにおいて優れた性能を示した。
Abstract
Theory of Mind (ToM) enables an agent to reason about another actor's beliefs, goals, and intentions, which is essential for human-centered embodied assistance. Existing ToM benchmarks have advanced text and multimodal mental-state recognition, but they mostly evaluate offline question answering or final action prediction. They do not fully test whether an embodied agent can stay connected to a changing environment, update actor-specific beliefs, decide when reasoning is needed, and intervene only when help is useful. Building on MindPower, we extend robot-centric ToM reasoning to a real-time closed-loop setting and introduce MindClaw, a framework for embodied mental-state reasoning with precision intervention. MindClaw connects multi-source inputs, belief memory, an embodied cognitive trigger skill, mental reasoning, and action generation, allowing the agent to output helpful actions at the right time while remaining silent when intervention is unnecessary. Experiments show that direct VLM baselines struggle with task awareness and intervention calibration, while MindClaw achieves the best overall performance, demonstrating the importance of trigger-skill optimization for closed-loop embodied ToM assistance.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: