AIDB Daily Papers

MemRouter：長期対話エージェントのためのメモリ埋め込みルーティング

原題: MemRouter: Memory-as-Embedding Routing for Long-Term Conversational Agents

著者: Tianyu Hu, Weikai Lin, Weizhi Zhang, Jing Ma, Song Wang

公開日: 2026-05-01 | 分野: LLM NLP AI 埋め込み記憶 cs.CL cs.AI 対話システム AIエージェント

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

長期対話エージェントの記憶管理を、埋め込みベースのルーティングポリシーで効率化しました。
従来の逐次的な記憶管理に比べ、応答生成とは独立させることで大幅な高速化を実現しました。
LoCoMoデータセットでの評価では、既存手法より高い精度と大幅な低遅延を達成しました。

Abstract

Long-term conversational agents must decide which turns to store in external memory, yet recent systems rely on autoregressive LLM generation at every turn to make that decision. We present MemRouter, a write-side memory router that decouples memory admission from the downstream answer backbone and replaces per-turn memory-management decoding with an embedding-based routing policy. MemRouter encodes each turn together with recent context, projects the resulting embeddings through a frozen LLM backbone, and predicts whether the turn should be stored using lightweight classification heads while training only 12M parameters. Under a controlled matched-harness comparison on LoCoMo, where the retrieval pipeline, answer prompts, and QA backbone (Qwen2.5-7B) are held identical, MemRouter outperforms an LLM-based memory manager on every question category (overall F1 52.0 vs 45.6, non-overlapping 95% CIs) while reducing memory-management p50 latency from 970ms to 58ms. Descriptive factorial averaging further shows that learned admission improves mean F1 by +10.3 over random storage, category-specific prompting adds +5.2 over a generic prompt, and retrieval contributes +0.7. These results suggest that write-side memory admission can be learned by a small supervised router, while answer generation remains a separate downstream component in long-horizon conversational QA.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.00356
カテゴリ: cs.CL, cs.AI

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報