AIDB Daily Papers
コードの綺麗さはコーディングAIに影響するか?最小ペア研究による検証
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- コードの構造やスタイルといった「綺麗さ」が、AIコーディングエージェントのタスク遂行能力に影響するかを検証した。
- 既存研究ではコードの綺麗さの影響が未解明だったが、本研究ではコードの綺麗さのみを変化させた最小ペアを用いて評価した。
- コードの綺麗さはタスク完了率に影響しなかったが、トークン消費量とファイル再訪率を大幅に削減することが明らかになった。
Abstract
As autonomous coding agents see rapid adoption, their evaluation has primarily focused on task completion rates holding the target codebase fixed. This leaves a critical question unanswered: does the structural and stylistic quality, or ``cleanliness'' of the underlying code affect an agent's ability to navigate and modify it? To isolate the effect of code cleanliness from agent capability, we introduce an evaluation protocol built around minimal pairs: repositories that match on architecture, dependencies, and external behaviour, but differ on static-analysis rule violations and cognitive complexity. The pairs are constructed in both directions, by agent pipelines that either degrade a clean repository or clean a messy one. We author 33 tasks across six such pairs, evaluated through hidden tests at the application's public surface. Across 660 trials with Claude Code, code cleanliness does not change the agent's pass rate. However, it substantially alters the agent's operational footprint: agents working on cleaner code use 7 to 8% fewer tokens and reduce file revisitations by 34%. Our findings suggest that traditional maintainability principles remain highly relevant in the era of AI-driven development, shaping the computational cost and navigational efficiency of coding agents. Code cleanliness joins model choice, harness, and prompting as a factor that materially affects agent behaviours.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: