AIDB Daily Papers
生成エージェントベースモデリングにおけるメカニズムの妥当性評価
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- LLMを用いたエージェントベースモデリングにおいて、現象の再現性だけでなく、そのメカニズムの妥当性を評価する尺度を提案した。
- 現象の生成能力と、それがどのように生じるかのメカニズムを区別し、科学哲学の観点からモデルの評価を明確化する。
- 提案されたメカニズム妥当性尺度(Mechanism Plausibility Scale)により、予測モデルと説明モデルの役割が明確になった。
Abstract
Large language models (LLMs) can generate high-level diverse phenomena without explicitly programmed rules. This capability has led to their adoption within different agent-based models (ABMs) and social simulations. Recently, research has aim to test whether they are capable of generating different phenomena of interest, for example, human behavior on social media platforms or performance in game-theoretic scenarios. However, capability, prediction, and explanation are different -- drawing from the philosophy of science and mechanisms literature, textit{explanation} requires showing, to some degree, how a phenomenon is produced by related organized entities and activities. For modelers, describing the characteristics of an experiment or whether a simulation provides progress in capability (or explanation), can be difficult without being grounded in potentially distant research areas. We integrate recent work on LLM-ABMs with contemporary philosophy of science literature and use it to operationalize a definition of `plausibility' in a four-level scale. Our scale separates the evaluation of a model's generative sufficiency (ability to reproduce a phenomenon) from its mechanistic plausibility (how the phenomenon could be produced), and clarifies the distinct roles of different models, such as predictive and explanatory ones. We introduce this as the Mechanism Plausibility Scale.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: