AIDB Daily Papers

マルチエージェントソフトウェア開発のための専門家オーケストレーションキューイング：SPOQ

原題: SPOQ: Specialist Orchestrated Queuing for Multi-Agent Software Engineering

著者: Royce Carbowitz, Dheeraj Kumar

公開日: 2026-06-02 | 分野: ソフトウェアエンジニアリング cs.MA cs.SE AIエージェントマルチエージェントシステム AI支援

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

タスク依存グラフから並列実行波を計算する波ベーストポロジディスパッチを導入しました。
計画・コード検証の二重検証ゲートにより、手戻りサイクルを削減し、品質を向上させました。
人間をエージェントとして統合し、分解への参加や実行中の相談を可能にすることで、コストと品質のトレードオフを最適化しました。

Abstract

Multi-agent AI systems show promise for automating software engineering tasks, yet existing approaches suffer from coordination overhead, quality control gaps, and limited human oversight. We introduce SPOQ (Specialist Orchestrated Queuing), a methodology combining three innovations: (1) wave-based topological dispatch that computes parallel execution waves from task dependency graphs; (2) dual validation gates applying quality metrics before execution (planning validation) and after (code validation) to reduce rework cycles; and (3) Human-as-an-Agent (HaaA) integration, where a human specialist participates in decomposition and can be consulted during execution. SPOQ uses a three-tier agent hierarchy (Opus workers, Sonnet reviewers, Haiku investigators) to optimize cost-quality tradeoffs. We evaluate SPOQ through four experiments. Experiment 1: wave dispatch approaches the critical-path lower bound (ratio 1.03--1.11, speedup up to 14.3x); on a 2-slot local backend it delivers a stable 1.4x speedup. Experiment 2: SPOQ improves planning coverage from 93.0 to 99.75, eliminates cyclic plans, and lifts parallelism from 31.0 to 75.25. Experiment 3: dual validation reduces defects from 0.34 to 0.20 per task and lifts test pass rate from 91.25% to 99.75%. Experiment 4: human review reduces residual defects from 0.47 to 0.03 per task. Results are replicated on a locally hosted open-weights model (Qwen3.6-35B-A3B), verifying gains are attributable to orchestration rather than any specific model. A longitudinal study across 17 repositories, 8,589 commits, 1,822 tasks, and 13,866 tests (99.87% pass rate) provides ecological validation.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2606.03115
カテゴリ: cs.SE, cs.MA

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報