AIDB Daily Papers

AIポーカー：学習・ソルバー不要でエキスパート級のプレイを実現する「PokerSkill」

原題: PokerSkill: LLMs Can Play Expert-Level Poker without Training or Solvers

著者: Boning Li, Baoxiang Wang, Longbo Huang

公開日: 2026-05-28 | 分野: LLM AI cs.AI AIエージェント cs.GT AI支援

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

人間エキスパートが設計したポーカーのスキルライブラリとLLMを組み合わせることで、学習やソルバーなしでポーカーをプレイするフレームワークを提案した。
この研究は、複雑な不完全情報ゲームにおいて、LLMが学習やソルバーなしで競争力のある性能を発揮できることを初めて示した点で重要である。
提案手法は、既存の最先端GTOベンチマークに対し、損失を大幅に削減し、学習済みの強力なボットをも上回る性能を示した。

Abstract

Poker is a landmark challenge for artificial intelligence. The dominant approach relies on equilibrium solvers built on counterfactual regret minimization, requiring millions of core-hours of training. Large Language Models (LLMs) possess extensive poker knowledge but perform far below solver-based agents when asked to play directly. Traditional rule-based poker agents are interpretable and training-free, but their strategic ceiling remains far below equilibrium play. We introduce textbf{PokerSkill}, a training-free and solver-free framework that bridges this gap by using detailed rule-based poker skills as a structured action-grounding interface for LLMs. A deterministic context engine analyzes the current state and retrieves only the relevant fragments from a layered skill library, which is entirely designed by human poker experts, constraining the LLM's choice to reasonable actions. Against GTOWizard, a state-of-the-art GTO benchmark, GPT-5.5 XHigh with PokerSkill achieves $-57 pm 21$ mbb/hand, Claude Opus 4.6 achieves $-80 pm 29$ mbb/hand and Claude Opus 4.7 achieves $-87pm 64$ mbb/hand, reducing losses by 49--61% compared to default-prompt baselines and outperforming the strong bot Slumbot. Our key finding is that rule-based skills alone do not constitute a strong strategy, and LLMs alone cannot play well, but their combination yields an agent that requires neither training nor solver access yet competes with systems built on millions of core-hours of computation. To our knowledge, this is the first demonstration of an LLM achieving competitive performance in a complex imperfect-information game without game-specific training or solver queries. Code is available at https://github.com/lbn187/PokerSkill.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.30094
カテゴリ: cs.AI, cs.GT

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報