AIDB Daily Papers

エージェントが進化させる解釈性ツール：自動研究によるエージェント的イモデリング

原題: Agentic-imodels: Evolving agentic interpretability tools via autoresearch

著者: Chandan Singh, Yan Shuo Tan, Weijia Xu, Zelalem Gero, Weiwei Yang, Michel Galley, Jianfeng Gao

公開日: 2026-05-05 | 分野: LLM 解釈性 AI 自動化 cs.CL cs.AI cs.LG AIエージェント

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

エージェントがデータサイエンス作業の大部分を行う未来に向け、エージェントが解釈可能なデータサイエンスツールを開発する。
予測性能とLLMによる解釈性指標を最適化し、エージェントがモデルの挙動を理解できる文字列表現を持つライブラリを構築した。
開発されたモデルは予測性能とエージェント向け解釈性を向上させ、下流のADSタスクの性能を最大73%向上させた。

Abstract

Agentic data science (ADS) systems are rapidly improving their capability to autonomously analyze, fit, and interpret data, potentially moving towards a future where agents conduct the vast majority of data-science work. However, current ADS systems use statistical tools designed to be interpretable by humans, rather than interpretable by agents. To address this, we introduce Agentic-imodels, an agentic autoresearch loop that evolves data-science tools designed to be interpretable by agents. Specifically, it develops a library of scikit-learn-compatible regressors for tabular data that are optimized for both predictive performance and a novel LLM-based interpretability metric. The metric measures a suite of LLM-graded tests that probe whether a fitted model's string representation is "simulatable" by an LLM, i.e. whether the LLM can answer questions about the model's behavior by reading its string output alone. We find that the evolved models jointly improve predictive performance and agent-facing interpretability, generalizing to new datasets and new interpretability tests. Furthermore, these evolved models improve downstream end-to-end ADS, increasing performance for Copilot CLI, Claude Code, and Codex on the BLADE benchmark by up to 73%

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.03808
カテゴリ: cs.AI, cs.CL, cs.LG

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報