AIDB Daily Papers

気候科学の自律的・開放的分析のためのLLMエージェント「ClimAgent」

原題: ClimAgent: LLM as Agents for Autonomous Open-ended Climate Science Analysis

著者: Hao Wang, Jindong Han, Wei Fan, Hao Liu

公開日: 2026-04-18 | 分野: LLM AI 分析研究 cs.AI 気候

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

気候科学におけるデータとツールの複雑さによる研究のボトルネックを解消するため、LLMを活用した自律型フレームワークを開発しました。
本研究は、LLMを単なる質疑応答を超え、気候科学の専門的な分析やモデリングをエンドツーエンドで実行できるエージェントとして活用する点で新規性があります。
提案手法は、気候科学の現実的な課題を網羅した初のベンチマーク「ClimaBench」において、既存手法を大幅に上回る性能を示しました。

Abstract

Climate research is pivotal for mitigating global environmental crises, yet the accelerating volume of multi-scale datasets and the complexity of analytical tools have created significant bottlenecks, constraining scientific discovery to fragmented and labor-intensive workflows. While the emergence Large Language Models (LLMs) offers a transformative paradigm to scale scientific expertise, existing explorations remain largely confined to simple Question-Answering (Q&A) tasks. These approaches often oversimplify real-world challenges, neglecting the intricate physical constraints and the data-driven nature required in professional climate science.To bridge this gap, we introduce ClimAgent, a general-purpose autonomous framework designed to execute a wide spectrum of research tasks across diverse climate sub-fields. By integrating a unified tool-use environment with rigorous reasoning protocols, ClimAgent transcends simple retrieval to perform end-to-end modeling and analysis.To foster systematic evaluation, we propose ClimaBench, the first comprehensive benchmark for real-world climate discovery. It encompasses challenging problems spanning 5 distinct task categories derived from professional scenarios between 2000 and 2025. Experiments on ClimaBench demonstrate that ClimAgent significantly outperforms state-of-the-art baselines, achieving a 40.21% improvement over original LLM solutions in solution rigorousness and practicality. Our code are available at https://github.com/usail-hkust/ClimAgent.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2604.16922
カテゴリ: cs.AI

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報