AIDB Daily Papers
SMILE-Next:大規模言語モデルに笑いの検出・分類・推論を教える
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 現実世界での笑いの理解を深めるため、笑いの検出・分類・推論を行うデータセットとモデルを開発しました。
- 笑いに特化した自己指示生成と専門家ルーティング機構により、タスク適応性と効率性を向上させました。
- 提案手法は既存のマルチモーダルLLMを大幅に上回り、現実世界の笑いの理解を促進するものです。
Abstract
Laughter is a complex social signal that conveys communicative intent beyond amusement. While prior work has focused on isolated laughter analysis tasks, a comprehensive understanding of laughter in real-world scenarios remains underexplored. Therefore, we introduce SMILE-Next, a dataset for real-world laughter understanding with multimodal textual representations and question-answer annotations across three tasks: laughter detection, laughter type classification, and laughter reasoning. Building upon SMILE-Next, we aim to develop a laughter-specialized large language model capable of nuanced understanding of laughter in real-world contexts. To this end, we propose two key components: laughter-specific Self-Instruct and the Mixture-of-Laugh-Experts (MoLE) framework. Laughter-specific Self-Instruct enhances generalization across tasks and domains by automatically synthesizing diverse laughter-centric instructions. MoLE introduces a task-adaptive expert routing mechanism that dynamically selects specialized experts tailored to each laughter-related task, improving task-specific performance and efficiency. Experimental results show that the combination of our proposed components substantially outperforms multimodal LLM baselines, advancing robust real-world laughter understanding. Project page is at: https://mok0102.github.io/smile-next/.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: