AIDB Daily Papers
LegalWorld:法律エージェントのためのライフサイクル対話型環境
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 中国の民事訴訟を5段階のライフサイクルとしてモデル化した対話型環境LegalWorldを構築した。
- 既存のベンチマークが個別のタスクしか評価しないのに対し、本研究は段階間の因果関係を考慮した。
- 構築した環境とLongJud-Benchにより、エージェントの全ライフサイクルにわたる能力を評価した。
Abstract
Civil litigation is inherently a life-cycle process: what a lawyer drafts on day one constrains what unfolds at trial months later. Yet existing legal benchmarks evaluate isolated subtasks, and prior legal-agent simulators reinitialize each scenario from shared ground truth, leaving cross-stage causal dependencies unmodeled. We present LegalWorld, a life-cycle interactive environment that models Chinese civil litigation as a causally connected state chain of five stages (seven sub-scenarios), grounded in 75,309 paired Chinese civil judgments. We pair it with reusable infrastructure (local memory, global case memory, a Skill/Tool library) that keeps each dispute consistent across its full life cycle. Building on this environment, we construct LongJud-Bench to evaluate agent capability across all five connected stages. 18,992 ratings from 217 legal-background evaluators confirm that LegalWorld trajectories are procedurally faithful and role-consistent; and a capability-level cross-model evaluation reveals sharp divergences that aggregate scores cannot expose, with no single backbone leading across consultation, drafting, and courtroom advocacy. Detailed resources will be released publicly.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: