AIDB Daily Papers

AIに「睡眠」を！記憶を定着させ自己進化する新パラダイム

原題: Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories

著者: Ali Behrouz, Farnoosh Hashemi, Vahab Mirrokni

公開日: 2026-06-02 | 分野: 機械学習 AI 深層学習 cs.AI cs.LG 継続学習

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

AIモデルが継続的に学習し、短期記憶を長期知識へ定着させる「睡眠」パラダイムを提案した。
人間の学習プロセスに着想を得て、記憶の定着と自己改善を可能にする点が重要である。
実験により、長期学習や知識の取り込み、少数ショット汎化タスクにおける「睡眠」の有効性が示された。

Abstract

The past few decades have witnessed significant advances in the design of machine learning algorithms, from early studies on task-specific shallow models to more general deep Large Language Models (LLMs). Despite showing promising results in tasks that require instant prediction or in-context learning, existing models lack the ability to continually learn and effectively transfer their temporal in-context knowledge to their long-term parameters. Inspired by human learning process, we introduce a ''Sleep'' paradigm that allows the models to continually learn, distill their short-term fragile memories into stable long-term knowledge with replay, and recursively improve themselves with ''Dreaming'' process. In more detail, sleep consists of two stages: (1) Memory Consolidation: an upward distillation process, called Knowledge Seeding, where the memories of a smaller-self are distilled into a larger network to provide more capacity while preserving the knowledge. As a proof of concept, we present a new Generalized Distillation process for {Knowledge Seeding} (i.e., the combination of on-policy distillation with Reinforcement Learning (RL)-based imitation learning); (2) Dreaming: a self-improvement phase, where the model uses RL to generate a curriculum of synthetic data to rehearse new knowledge and refine existing capabilities without human supervision. Our experiments on long-horizon, continual learning, knowledge incorporation, and few-shot generalization tasks support the importance of the sleep stage.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2606.03979
カテゴリ: cs.LG, cs.AI

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報