AIDB Daily Papers
自然言語から実行可能なビジュアルワークフローを生成するベンチマーク「Chat2Workflow」
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 自然言語から実行可能なビジュアルワークフローを生成するベンチマーク「Chat2Workflow」を開発した。
- 既存のワークフロー構築は手作業でコストと時間がかかり、エラーも発生しやすいため、LLMによる自動化を目指した。
- 最先端のLLMでも複雑な要求には対応できず、生成されたワークフローの実行エラー修正に課題が残った。
Abstract
At present, executable visual workflows have emerged as a mainstream paradigm in real-world industrial deployments, offering strong reliability and controllability. However, in current practice, such workflows are almost entirely constructed through manual engineering: developers must carefully design workflows, write prompts for each step, and repeatedly revise the logic as requirements evolve-making development costly, time-consuming, and error-prone. To study whether large language models can automate this multi-round interaction process, we introduce Chat2Workflow, a benchmark for generating executable visual workflows directly from natural language, and propose a robust agentic framework to mitigate recurrent execution errors. Chat2Workflow is built from a large collection of real-world business workflows, with each instance designed so that the generated workflow can be transformed and directly deployed to practical workflow platforms such as Dify and Coze. Experimental results show that while state-of-the-art language models can often capture high-level intent, they struggle to generate correct, stable, and executable workflows, especially under complex or changing requirements. Although our agentic framework yields up to 5.34% resolve rate gains, the remaining real-world gap positions Chat2Workflow as a foundation for advancing industrial-grade automation. Code is available at https://github.com/zjunlp/Chat2Workflow.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: