AIDB Daily Papers

LLM生成コードの可読性スペクトル：パターン、問題点、プロンプト効果の解明

原題: The Readability Spectrum: Patterns, Issues, and Prompt Effects in LLM-Generated Code

著者: Hengzhi Ye, Fengyuan Ran, Weiwei Xu, Minghui Zhou

公開日: 2026-05-13 | 分野: LLM 可読性ソフトウェアエンジニアリングコード生成 cs.AI cs.SE プロンプトエンジニアリング

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

LLM生成コードの可読性を評価するため、テキスト、構造、プログラム、視覚的特徴を統合した包括的なモデルを構築した。
現在のLLMは人間が書いたコードと同等の可読性を持つが、特有の可読性問題パターンを示すことが判明した。
関数シグネチャ、制約、スタイル記述が可読性に最も影響するが、プロンプト全体の効果は限定的であった。

Abstract

As Large Language Models (LLMs) are transforming software development, the functional quality of generated code has become a central focus, leaving readability, one of critical non-functional attributes, understudied. Given that LLM-generated code still needs human review before adoption, it is important to understand its readability especially compared with human-written code and the role of prompt design in shaping it. We therefore set out to conduct a systematic investigation into the code readability of LLM-generated code. To systematically quantify code readability, We establish a comprehensive readability model that synthesizes textual, structural, program, and visual features of code. Based on the model, we evaluate the readability of code generated by the mainstream LLMs under 5,869 scenarios extracted from large code base including World of Code (WoC) and LeetCode. We find that current LLMs produce code with overall readability comparable to human-written code, but displaying distinct readability issue patterns. We further examine how different prompt dimensions affect the readability of LLM-generated code, and find that function signatures, constraints and style descriptions emerge as the most influential factors, while the overall impact of prompt design remains limited. Our findings indicate that, on one hand, LLM-generated code is at least comparable to human-written code in readability, validating its potential for systematic integration into software workflows from a non-functional perspective; on the other hand, distinct readability issue patterns and limited effectiveness of prompt engineering reveal a latent technical debt, highlighting the need for future research to improve the readability of LLM-generated code and thus ensure long-term maintainability.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.13280
カテゴリ: cs.SE, cs.AI

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報