AIDB Daily Papers
Text2Score:テキスト指示から楽譜を生成するAI
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- テキスト指示から楽譜を生成する「Text2Score」を開発し、テキストと楽譜のペアが少ない課題を克服した。
- LLMが楽譜の構造計画を立て、それを基にABC記譜法で楽譜を生成する二段階フレームワークを提案した点が新しい。
- 専門家による評価で、既存手法を上回る演奏性、可読性、指示への忠実性を確認した。
Abstract
Developing text-driven symbolic music generation models remains challenging due to the scarcity of aligned text-music datasets and the unreliability of automated captioning pipelines. While most efforts have focused on MIDI, sheet music representations are largely underexplored in text-driven generation. We present Text2Score, a two-stage framework comprising a planning stage and an execution stage for generating sheet music from natural language prompts. By deriving supervision signals directly from symbolic XML data, we propose an alternative training paradigm that bypasses noisy or scarce text-music pairs. In the planning stage, an LLM orchestrator translates a natural language prompt into a structured measure-wise plan defining musical attributes such as instruments, key, time signatures, harmony, etc. This plan is then consumed by a generative model in the execution stage to produce interleaved ABC notation conditioned on the plan's structural constraints. To assess output quality, we introduce an evaluation framework covering playability, readability, instrument utilization, structural complexity, and prompt adherence, validated by expert musicians. Text2Score consistently outperforms both a pure LLM-based agentic framework and three end-to-end baselines across objective and subjective dimensions. We open-source the dataset, code, evaluation set and LLM prompts used in this work; a demo is available on our project page (https://keshavbhandari.github.io/portfolio/text2score).
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: