AIDB Daily Papers
進化する医療AIエージェント:記憶、反省、改善による診断能力の飛躍
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 本研究では、過去の症例から学習し、診断精度を向上させるEvo-MedAgentを提案した。
- 医師が経験を積むように、症例間の学習を可能にする自己進化型メモリが重要な新規性である。
- Evo-MedAgentは、GPT-5-miniでMCQ精度を0.68から0.79に、Gemini-3 Flashで0.76から0.87に向上させた。
Abstract
Tool-augmented large language model (LLM) agents can orchestrate specialist classifiers, segmentation models, and visual question-answering modules to interpret chest X-rays. However, these agents still solve each case in isolation: they fail to accumulate experience across cases, correct recurrent reasoning mistakes, or adapt their tool-use behavior without expensive reinforcement learning. While a radiologist naturally improves with every case, current agents remain static. In this work, we propose Evo-MedAgent, a self-evolving memory module that equips a medical agent with the capacity for inter-case learning at test time. Our memory comprises three complementary stores: (1)~emph{Retrospective Clinical Episodes} that retrieve problem-solving experiences from similar past cases, (2)~an emph{Adaptive Procedural Heuristics} bank curating priority-tagged diagnostic rules that evolves via reflection, much like a physician refining their internal criteria, and (3)~a emph{Tool Reliability Controller} that tracks per-tool trustworthiness. On ChestAgentBench, Evo-MedAgent raises multiple-choice question (MCQ) accuracy from 0.68 to 0.79 on GPT-5-mini, and from 0.76 to 0.87 on Gemini-3 Flash. With a strong base model, evolving memory improves performance more effectively than orchestrating external tools on qualitative diagnostic tasks. Because Evo-MedAgent requires no training, its per-case overhead is bounded by one additional retrieval pass and a single reflection call, making it deployable on top of any frozen model.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: