AIDB Daily Papers

スキル新語：スキルベースの継続学習に向けて

原題: Skill Neologisms: Towards Skill-based Continual Learning

著者: Antonin Berthon, Nicolas Astorga, Mihaela van der Schaar

公開日: 2026-05-06 | 分野: LLM cs.AI cs.LG 継続学習スキル拡張

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

新しいスキルを既存のLLMにスケーラブルに拡張する手法として、ソフトトークンを用いたスキル新語を提案した。
スキル新語は、重み更新なしで特定のスキル能力を向上させ、既存の能力とも組み合わせ可能である点が重要である。
独立して学習したスキル新語をゼロショットで組み合わせ可能であり、スケーラブルな継続学習の道筋を示唆した。

Abstract

Modern LLMs show mastery over an ever-growing range of skills, as well as the ability to compose them flexibly. However, extending model capabilities to new skills in a scalable manner is an open-problem: fine-tuning and parameter-efficient variants risk catastrophic forgetting, while context-based approaches have limited expressiveness and are constrained by the model's effective context. We explore skill neologisms--i.e., soft tokens integrated in the model's vocabulary and optimized to improve capabilities over a specific skill--as a way to selectively extend model capabilities to new skills without weight updates. We first observe that off-the-shelf pre-trained LLMs already demonstrate tokens associated with procedural knowledge. We then show that skill neologisms can be learned to improve model capabilities on specific skills while being composable with out-of-distribution skills, and that independently trained skill neologisms can be composed zero-shot. These results suggest that skill neologisms may provide a scalable path towards skill-based continual learning.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.04970
カテゴリ: cs.LG, cs.AI

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報