AIDB Daily Papers
LLMの幻覚が論文に蔓延:存在しない引用から大規模な証拠を発見
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- LLMが生成する架空の引用が、arXivなどのプレプリントサーバーで急増していることを大規模調査で明らかにした。
- LLMの普及に伴い、特にAI研究分野やAI支援執筆の痕跡がある論文で、架空引用が顕著に増加している。
- 架空引用は既存の不平等を助長する可能性があり、論文の信頼性と科学的認識の公平性を脅かしている。
Abstract
Large language models (LLMs) are known to generate plausible but false information across a wide range of contexts, yet the real-world magnitude and consequences of this hallucination problem remain poorly understood. Here we leverage a uniquely verifiable object - scientific citations - to audit 111 million references across 2.5 million papers in arXiv, bioRxiv, SSRN, and PubMed Central. We find a sharp rise in non-existent references following widespread LLM adoption, with a conservative estimate of 146,932 hallucinated citations in 2025 alone. These errors are diffusely embedded across many papers but especially pronounced in fields with rapid AI uptake, in manuscripts with linguistic signatures of AI-assisted writing, and among small and early-career author teams. At the same time, hallucinated references disproportionately assign credit to already prominent and male scholars, suggesting that LLM-generated errors may reinforce existing inequities in scientific recognition. Preprint moderation and journal publication processes capture only a fraction of these errors, suggesting that the spread of hallucinated content has outpaced existing safeguards. Together, these findings demonstrate that LLM hallucinations are infiltrating knowledge production at scale, threatening both the reliability and equity of future scientific discovery as human and AI systems draw on the existing literature.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: