AIDB Daily Papers

後悔最小化による適応的インセンティブ設計：情報不足下での最適戦略

原題: Adaptive Incentive Design with Regret Minimization

著者: Georgios Vasileiou, Lantian Zhang, Silun Zhang

公開日: 2026-04-07 | 分野: 強化学習ゲーム機械学習最適化制御数理モデルシステム経済理論アルゴリズム

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

本研究では、情報非対称性下でインセンティブメカニズムを設計する新しい問題RAIDを提案した。
RAIDアルゴリズムは、探索と活用を切り替えることで、完全情報を持つ場合に匹敵する後悔を漸近的に最小化する。
提案手法は、タイプ推定の強一致性を示し、プランナーの平均後悔をほぼ確実に漸近的に最小化することを証明した。

Abstract

Incentive design constitutes a foundational paradigm for influencing the behavior of strategic agents, wherein a system planner (principal) publicly commits to an incentive mechanism designed to align individual objectives with collective social welfare. This paper introduces the Regret-Minimizing Adaptive Incentive Design (RAID) problem, which aims to synthesize incentive laws under information asymmetry and achieve asymptotically minimal regret compared to an oracle with full information. To this end, we develop the RAID algorithm, which employs a switching policy alternating between probing (exploration) and estimate-based incentivization (exploitation). The associated type estimator relies only on a weaker excitation condition required for strong consistency in least squares estimation, substantially relaxing the persistence-of-excitation assumptions previously used in adaptive incentive design. In addition, we establish the strong consistency of the proposed type estimator and prove that the incentive obtained asymptotically minimizes the planner's average regret almost surely. Numerical experiments illustrate the convergence rate of the proposed methodology.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2604.05977
カテゴリ: math.OC, cs.GT, cs.MA, eess.SY

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報