AIDB Daily Papers
AIによる仕事への影響は、モデルの事前知識ではなく証拠に基づいて測定すべき
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- AIによる仕事への影響を、LLMの事前知識のみに頼るのではなく、証拠に基づいた方法で測定することを提唱する。
- 現在の理論的な測定方法は、透明性や外部検証のないラベルを生成するため、政策決定や労働者の将来見通しに影響を与えるには不十分である。
- ニュース記事や学術論文を証拠として利用する検索拡張フレームワークを提案し、従来のゼロショット法よりも実態に即したAI能力の測定が可能であることを示した。
Abstract
This position paper argues that job exposure to AI should be measured with grounded, evidence-based methods, not inferred from LLM priors alone. Current theoretical exposure measures use zero-shot prompting to classify task-level AI exposure, generating labels with no explicit evidence, no transparent chain of reasoning, and no external validation. The stakes of these measurements are too high to rely on such methods, as they influence policy making, where public and private funds are directed, and how workers understand their future prospects. We therefore argue that AI capability claims should meet three standards: reproducibility, external grounding, and inspectability. We propose a retrieval-augmented framework that assigns AI exposure labels to all 18,796 occupation--task pairs in O*NET 30.2, using open-weight reasoning and instruct models with retrieved news articles and academic paper abstracts as evidence of current AI capabilities. Relative to a zero-shot baseline, the grounded condition is preferred in over 72% of disagreement cases under both automatic and human evaluation, and yields scores that align more closely with observed real-world AI usage. Taken together, these findings suggest that evidence-grounded measurement better captures what current AI systems can plausibly do in practice, rather than what a model asserts without external evidence. Because AI capabilities continue to change, the measurements used to inform policy must evolve with them: theoretical AI exposure scores should be periodically reassessed, not inherited as immutable ground truth.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: