AIDB Daily Papers

SciResearcher：最先端科学推論のための深層リサーチエージェントのスケーリング

原題: SciResearcher: Scaling Deep Research Agents for Frontier Scientific Reasoning

著者: Tianshi Zheng, Rui Wang, Xiyun Li, Yangqiu Song, Tianqing Fang

公開日: 2026-05-02 | 分野: LLM NLP AI 科学研究深層学習 cs.CL cs.AI AIエージェント

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

学術的証拠に基づき、情報収集・ツール利用推論・長期的能力を引き出す自動化されたエージェントフレームワークを開発しました。
従来の手法では困難な、散在する知識や高度な計算・推論を要する最先端科学の課題に対応するため、新たなデータ構築パラダイムを提案します。
開発したSciResearcher-8Bは、HLE-Bio/Chem-GoldベンチマークでSOTAを達成し、既存の大規模エージェントを凌駕する結果を示しました。

Abstract

Frontier scientific reasoning is rapidly emerging as a key foundation for advancing AI agents in automated scientific discovery. Deep research agents offer a promising approach to this challenge. These models develop robust problem-solving capabilities through post-training on information-seeking tasks, which are typically curated via knowledge graph construction or iterative web browsing. However, these strategies face inherent limitations in frontier science, where domain-specific knowledge is scattered across sparse and heterogeneous academic sources, and problem solving requires sophisticated computation and reasoning far beyond factual recall. To bridge this gap, we introduce SciResearcher, a fully automated agentic framework for frontier-science data construction. SciResearcher synthesizes diverse conceptual and computational tasks grounded in academic evidence, while eliciting information acquisition, tool-integrated reasoning, and long-horizon capabilities. Leveraging the curated data for supervised fine-tuning and agentic reinforcement learning, we develop SciResearcher-8B, an agent foundation model that achieves 19.46% on the HLE-Bio/Chem-Gold benchmark, establishing a new state of the art at its parameter scale and surpassing several larger proprietary agents. It further achieves 13-15% absolute gains on SuperGPQA-Hard-Biology and TRQA-Literature benchmarks. Overall, SciResearcher introduces a new paradigm for automated data construction for frontier scientific reasoning and offers a scalable path toward future scientific agents.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.01489
カテゴリ: cs.AI, cs.CL

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報