AIDB Daily Papers

要求を考慮したカリキュラム強化学習によるLLMコード生成の改善

原題: Improving LLM Code Generation via Requirement-Aware Curriculum Reinforcement Learning

著者: Shouyu Yin, Zhao Tian, Junjie Chen, Shikai Guo

公開日: 2026-05-01 | 分野: LLM 強化学習 AI コード生成 cs.AI cs.SE 要求工学

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

本研究は、LLMによるコード生成の性能限界を克服するため、要求を考慮したカリキュラム強化学習フレームワークRECRLを提案した。
既存手法の課題であった要求の難易度認識のずれや最適化の欠如を解消し、学習データの利用効率を高めることが重要である。
RECRLは、モデル固有の要求難易度を自動認識し、難易度を最適化することで、平均Pass@1精度を1.23%-5.62%向上させた。

Abstract

Code generation, which aims to automatically generate source code from given programming requirements, has the potential to substantially improve software development efficiency. With the rapid advancement of large language models (LLMs), LLM-based code generation has attracted widespread attention from both academia and industry. However, as programming requirements become increasingly complex, existing LLMs still exhibit notable performance limitations. To address this challenge, recent studies have proposed training-based curriculum reinforcement learning (CRL) strategies to improve LLM code generation performance. Despite their effectiveness, existing CRL approaches suffer from several limitations, including misaligned requirement difficulty perception, the absence of requirement difficulty optimization, and suboptimal curriculum sampling strategies. In CRL-based code generation, programming requirements serve as the sole input to the model, making their quality and difficulty critical to training effectiveness. Motivated by insights from software requirements engineering, we propose RECRL, a novel requirement-aware curriculum reinforcement learning framework for enhancing LLM-based code generation. RECRL automatically perceives model-specific requirement difficulty, optimizes challenging requirements to improve training data utilization, and employs an adaptive curriculum sampling strategy to construct training batches with smoothly varying difficulty. Extensive experiments on five state-of-the-art LLMs across five widely-used code generation benchmarks by comparing with five state-of-the-art baselines, demonstrate the significant effectiveness of RECRL. For example, RECRL achieves an average Pass@1 improvement of 1.23%-5.62% over all state-of-the-art baselines.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.00433
カテゴリ: cs.SE, cs.AI

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報