AIDB Daily Papers

自己中心的超知能は協力的になりにくい

原題: Solipsistic Superintelligence is Unlikely to be Cooperative

著者: Rakshit S Trivedi, Natasha Jaques, Logan Cross, Alexander Sasha Vezhnevets, Joel Z Leibo

公開日: 2026-06-02 | 分野: AI cs.AI cs.CY cs.LG AI安全性 AIガバナンス

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

本研究は、自己中心的アプローチで設計された超知能は協力的になりにくいと主張する。
AI研究の主要課題は、能力開発から共存への移行であり、自己最適化は現実世界との乖離を生む。
協調的なAI開発には、動的な評価、制度の組み込み、人間の主体性の維持が不可欠である。

Abstract

AI's central challenge is shifting from capability to coexistence. The dominant paradigm in AI research focuses on developing powerful agents that treat the world as an exogenous and stationary source of feedback. We contend that superintelligence, an extremely capable task solver, born out of such a solipsistic approach to AI design, is unlikely to be cooperative. Deploying AI systems induces endogenous non-stationarity, resulting in a train-test-deploy gap where historical distributions diverge from the deployment context. We refer to this as the self-undermining property of unilateral optimization. Closing this gap requires AI that participates in cooperation: the equilibrium-selection process through which multiple actors navigate their interdependence. We call for a non-solipsistic research paradigm that treats this interdependence as a core design principle rather than approaching cooperation as a task to solve. This entails building dynamic evaluation testbeds involving adaptive counterparties, treating institutions as design primitives, and preserving human agency as a structural feature of the systems we build.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2606.03237
カテゴリ: cs.AI, cs.CL, cs.CY, cs.LG, cs.MA

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報