AIDB Daily Papers
AIネイティブ開発を加速するメタエンジニアリング:契約駆動型敵対的検証アーキテクチャ
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 本研究では、AIネイティブソフトウェアの継続的な開発・検証・デプロイ・保守を実現するメタエンジニアリングアーキテクチャを提案した。
- 従来のモデル単位の評価では不十分な本番環境に対応するため、要件を契約に変換し、AIエージェントによる検証と自己改善を行う点が重要である。
- 早期の実運用デプロイメントでは、契約の不備や検証境界の問題が明らかになり、アーキテクチャの改善に直接つながった。
Abstract
AI-native software development is often evaluated at the level of individual models, prompts, or generated artifacts. This framing is insufficient for production environments where software must be continuously produced, verified, deployed, maintained, and adapted across many operational contexts and long time horizons. We present a meta-engineering harness: a software-production architecture that transforms operational and product feature requirements into explicit contracts, routes work through role-specialized AI agents, performs independent and adversarial verification, and continuously improves itself through structured failure classification and outer-loop calibration. The harness is designed for settings in which software delivery is not a one-time project but an ongoing operating function. In our motivating application, CTO-as-a-service for small service firms, the system manages websites, booking flows, payment systems, backoffice workflow automations, and AI-agent interfaces as continuously evolving technical infrastructure rather than one-off deliverables. We describe the layered architecture, including two-pass contract compilation, persistent markdown memory with specialization records, attention-based and independence-based verifications, a four-way failure arbiter, and outer-loop calibration. We report results from an early production deployment spanning 17 features over several weeks, including a detailed in-app payments case study that revealed contract incompleteness and verification-boundary issues. These observations directly drove targeted improvements to the harness. The contribution is an implemented, measurable, and extensible verification architecture for making AI-native service-as-a-software production reliable, auditable, and improvable over time.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: