AIDB Daily Papers
MuChator:対話型音楽LLMで抖音(Douyin)音楽の能動的な音楽発見を可能にする
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 抖音音楽プラットフォームにおける受動的な音楽発見の限界を克服するため、能動的な音楽発見を可能にする対話型LLMフレームワークMuChatorを開発した。
- MuChatorは、音楽知識の事前学習、文脈を考慮した指示チューニング、ハイブリッドRMによる嗜好性アライメントの3つの主要コンポーネントにより、ユーザーの曖昧な音楽意図を理解する。
- 本研究により、抖音音楽アプリでの導入でユーザーのアクティブ利用日数が46.49%向上し、既存モデルを凌駕する結果となった。
Abstract
Douyin Music, a large-scale platform with millions of daily users, adopts an immersive, feed-based discovery paradigm, where users passively explore music through continuous recommendations. While effective for passive music discovery, this paradigm restricts users to recommendation results and provides limited support for explicitly specifying listening intents. Unlike conventional search, where users express well-defined intents through explicit queries such as specific songs or artists, real-world active music discovery is often situational and colloquial, involving vague or underspecified requests. While LLMs enable natural language interaction, their direct use in music discovery remains limited by insufficient music-domain knowledge, lack of music-query collaborative reasoning, and shallow understanding of personalized preferences. To address these challenges, we introduce MuChator, an interactive MusicLLM-based framework that enables users to actively express situational music intents in natural language. MuChator incorporates three key components: (1) Music Knowledge Pre-training, a three-stage scheme that incrementally injects objective music knowledge, subjective music knowledge, and personalized music preferences into LLMs; (2) Context-aware Instruction Tuning, which constructs high-quality user-query-music triplets through an automated synthesis pipeline to align LLMs with active and situational user intents; and (3) Preference Alignment with Hybrid RM, which jointly models intent relevance, personalized preferences, and basic constraints, and is optimized using GRPO-based reinforcement learning. Extensive evaluations on industrial music recommendation datasets demonstrate that MuChator outperforms leading proprietary models, such as Gemini-3-Pro. The model has been deployed on Douyin Music App within ByteDance, with 46.49% improvement of user active days in online A/B test.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: