AIDB Daily Papers
MAS-Algorithm:マルチエージェントシステムによるアルゴリズムプログラミング問題解決ワークフロー
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- AIコーディングシステムの構造的推論能力を評価するため、アルゴリズム問題解決に特化したマルチエージェントワークフローMAS-Algorithmを提案した。
- 既存手法の限界を克服し、競技プログラマーのプラクティスに着想を得たモジュール化された段階と柔軟なエージェント連携により、構造的推論とツール統合を実現した。
- MAS-AlgorithmはQwenモデルで平均6.48%の正答率向上を達成し、個々のエージェントが最大27.7%の改善に寄与する可能性を示した。
Abstract
Algorithmic problem solving serves as a rigorous testbed for evaluating structured reasoning in AI coding systems, as it directly reflects a model's ability to perform structured reasoning in complex scenarios. Existing approaches predominantly rely on model-centric strategies, such as architectural modifications and data scaling, which are costly and offer limited interpretability. Alternative methods leveraging external tools or prompting techniques (e.g., chain-of-thought) are often fragmented and lack a unified framework. In this paper, we propose MAS-Algorithm, a systematic multi-agent workflow for algorithmic problem solving inspired by the practices of competitive programmers and algorithm engineers. Our framework decomposes the end-to-end solving process into modular stages, enabling structured reasoning, tool integration, and flexible coordination among agents. The design emphasizes both rigor and extensibility, allowing it to generalize across diverse problem types. Experimental results on a self-constructed benchmark demonstrate consistent improvements across multiple Qwen series models, achieving an average gain of 6.48% in acceptance rate. In contrast, parameter-efficient fine-tuning on the same data yields only a marginal improvement of 0.89%. We further observe a 4.72% gain on LiveCodeBench-Pro, along with consistent improvements across additional accuracy and efficiency metrics. Beyond performance gains, we conduct comprehensive analyses to better understand the reasoning process within the workflow, including error patterns and cross-scenario behaviors. We further perform customized replacement and ablation studies to explore the upper bound of the framework, showing that individual agents can contribute improvements of up to 27.7%. These results highlight the strong potential of MAS-Algorithm for advancing AI-driven algorithmic reasoning.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: