AIDB Daily Papers

PersonaVLM：長期的な個別化を実現するマルチモーダルLLM

原題: PersonaVLM: Long-Term Personalized Multimodal LLMs

著者: Chang Nie, Chaoyou Fu, Yifan Zhang, Haihua Yang, Caifeng Shan

公開日: 2026-03-20 | 分野: LLM マルチモーダル Transformer ベンチマーク画像対話評価個性パーソナライズ自然言語処理人間とAIの共生大規模言語モデル長期記憶

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

長期的な個別化を目指し、マルチモーダルLLMをユーザーの好みに適応させるPersonaVLMを開発しました。
過去のインタラクションから得た情報を記憶・推論し、ユーザーの性格に合わせた応答を生成する点が革新的です。
Persona-MMEベンチマークでベースラインを22.4%改善、GPT-4oも上回る性能を示しました。

Abstract

Multimodal Large Language Models (MLLMs) serve as daily assistants for millions. However, their ability to generate responses aligned with individual preferences remains limited. Prior approaches enable only static, single-turn personalization through input augmentation or output alignment, and thus fail to capture users' evolving preferences and personality over time (see Fig.1). In this paper, we introduce PersonaVLM, an innovative personalized multimodal agent framework designed for long-term personalization. It transforms a general-purpose MLLM into a personalized assistant by integrating three key capabilities: (a) Remembering: It proactively extracts and summarizes chronological multimodal memories from interactions, consolidating them into a personalized database. (b) Reasoning: It conducts multi-turn reasoning by retrieving and integrating relevant memories from the database. (c) Response Alignment: It infers the user's evolving personality throughout long-term interactions to ensure outputs remain aligned with their unique characteristics. For evaluation, we establish Persona-MME, a comprehensive benchmark comprising over 2,000 curated interaction cases, designed to assess long-term MLLM personalization across seven key aspects and 14 fine-grained tasks. Extensive experiments validate our method's effectiveness, improving the baseline by 22.4% (Persona-MME) and 9.8% (PERSONAMEM) under a 128k context, while outperforming GPT-4o by 5.2% and 2.0%, respectively. Project page: https://PersonaVLM.github.io.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

💬 ディスカッション

ディスカッションに参加するにはログインが必要です。

ログイン / アカウント作成 →

arxivで読む PDFを開く

メタ情報

arxiv ID: 2604.13074
カテゴリ: cs.CL, cs.CV

ポイント

Abstract

Paper AI Chat

💬 ディスカッション

関連するAIDB記事

メタ情報