AIDB Daily Papers
論文インパクトを予測する新手法FAME:科学的評価の未来を拓く
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- 論文の将来的なインパクトを予測する新手法FAMEを提案し、その有効性を検証した。
- 既存の大規模言語モデルはインパクトのある論文を正確に識別できず、動的な科学的評価には限界があった。
- FAMEは論文を動的な潜在空間に投影し、分野の進展方向を捉えることで、LLMを大幅に超える予測精度を示した。
Abstract
Large Language Models (LLMs) are increasingly used to brainstorm and evaluate research ideas, yet assessing such judgments is fundamentally difficult because the true impact of a new idea may take years to emerge. We address this challenge by using the impact forecasting of human-authored manuscripts as a verifiable proxy task. In a prospective forecasting study, we find that frontier LLMs fail to reliably distinguish high-impact papers from ordinary publications, suggesting that static text-based judging is insufficient for scientific evaluation. To address this limitation, we propose $textbf{FAME}$ ($underline{text{F}}$orecasting $underline{text{A}}$cademic Impact via Continuous-Time $underline{text{M}}$anifold $underline{text{E}}$volution), a spatiotemporal framework for modeling the dynamic trajectories of scientific topics. FAME projects papers into a dynamic latent space informed by textual features and a verified knowledge-flow graph, learning geometric constraints that align impactful manuscripts with the forward momentum of their fields. Experiments on 3,200 arXiv papers across three fast-evolving subfields show that FAME consistently and substantially outperforms state-of-the-art LLM evaluators in prospective multidimensional impact forecasting. Furthermore, integrating FAME's dynamic geometric signals into LLMs significantly improves their forecasting performance. These results support manuscript impact forecasting as a useful, measurable proxy benchmark and position FAME as a strong, trajectory-aware foundation for automated scientific evaluation.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: