AIDB Daily Papers
記憶の呪い:LLMエージェントの協調意図を拡張された記憶が損なうメカニズム
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- LLMエージェントにおいて、記憶(コンテキストウィンドウ)の拡張が協調行動を損なう「記憶の呪い」現象を発見した。
- この現象は、長期的な協調意図の低下に起因し、記憶内容が長さに依存せず協調性を低下させることを実証した。
- 記憶の呪いは、明示的な思考過程(Chain-of-Thought)によって悪化する可能性があり、記憶は多者間行動の能動的な決定要因となることを明らかにした。
Abstract
Context window expansion is often treated as a straightforward capability upgrade for LLMs, but we find it systematically fails in multi-agent social dilemmas. Across 7 LLMs and 4 games over 500 rounds, expanding accessible history degrades cooperation in 18 of 28 model--game settings, a pattern we term the memory curse. We isolate the underlying mechanism through three analyses. First, lexical analysis of 378,000 reasoning traces associates this breakdown with eroding forward-looking intent rather than rising paranoia. We validate this using targeted fine-tuning as a cognitive probe: a LoRA adapter trained exclusively on forward-looking traces mitigates the decay and transfers zero-shot to distinct games. Second, memory sanitization holds prompt length fixed while replacing visible history with synthetic cooperative records, which restores cooperation substantially, proving the trigger is memory content, not length alone. Finally, ablating explicit Chain-of-Thought reasoning often reduces the collapse, showing that deliberation paradoxically amplifies the memory curse. Together, these results recast memory as an active determinant of multi-agent behavior: longer recall can either destabilize or support cooperation depending on the reasoning patterns it elicits.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: