AIDB Daily Papers

意思疎通・予測・行動：エージェントのソーシャルインテリジェンスを評価する

原題: Communicate-Predict-Act: Evaluating Social Intelligence of Agents

著者: David Shoresh, Sarit Kraus, Yonatan Loewenstein

公開日: 2026-04-09 | 分野: マルチエージェントインタラクション戦略大規模言語モデルソーシャルメディア人工知能予測ヒューマンコンピュータインタラクションコミュニケーション LLM 行動社会心理評価エージェント推論ゲーム強化学習

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

LLMエージェントのソーシャルインテリジェンスを、協力・競争的な社会ゲームで評価するフレームワークを導入した。
LLMエージェントの制御可能性を活用し、行動予測、影響力、戦略的推論などの社会認知指標を抽出することが新しい。
影響力、透明性、適応性が、心の理論や深い計画よりもゲームの成功を予測する上で重要であることが判明した。

Abstract

As large language model (LLM) agents become more prevalent in real world social settings, social intelligence will play an increasingly critical role. But social intelligence is still a poorly defined construct, for humans and artificial agents. We introduce a multiplayer arena of mixed cooperative and competitive social games to study LLM social intelligence. The controllability of LLM based agents enables systematic evaluation, which also supports broader inferences about social intelligence per se. We evaluated eight diverse LLMs (24B to 1T parameters) using a Communicate Predict Act (COMPACT) interaction protocol and fine grained probing of social dynamics. Elo style ratings reveal consistent performance differences across models, but this scalar measure provides only a partial characterization of social intelligence. To address this limitation, we analyze gameplay traces to extract sociocognitive metrics capturing action prediction, communicative influence, strategic reasoning, and tradeoffs under conflicting interests. These sociocognitive metrics exhibit strong intramodel consistency and they reliably predict pairwise agent advantage in game outcomes (AUC ROC = 0.82). Feature importance analysis indicates that surprisingly, influence, transparency, and adaptability are more predictive of success than Theory of Mind inference or deep planning. Together, our results advance a testable, multidimensional conception of social intelligence and provide empirical insights into the capacities that underpin it.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2604.08727
カテゴリ: cs.CY

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報