AIDB Daily Papers

LLMの複数回推論における投票精度曲線：2回の呼び出しでわかること

原題: Two Calls, Two Moments, and the Vote-Accuracy Curve of Repeated LLM Inference

著者: Yi Liu

公開日: 2026-05-05 | 分野: LLM 統計機械学習自然言語処理深層学習アルゴリズム cs.CL cs.LG

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

LLMの推論において、複数回の呼び出しと多数決による精度向上のメカニズムを理論的に分析した。
2回の呼び出しで得られる情報から、例ごとの正答率のばらつきを推定し、確実な改善基準を導き出した。
実験により、温度設定やモデル混合が単一呼び出し精度では予測できない投票精度向上をもたらすことを示した。

Abstract

Repeated sampling is a standard way to spend test-time compute, but its benefit is controlled by the latent distribution of correctness across examples, not by one-call accuracy alone. We study the binary correctness layer of repeated LLM inference under conditional-i.i.d. calls. One labeled call identifies the mean latent success probability; two labeled calls identify its second moment and hence the same-example correctness correlation that separates stable errors from recoverable call-level randomness. From these two moments, every fixed majority-vote budget has a sharp distribution-free two-call interval. The key technical reduction is that the infinite-dimensional moment problem has three-atom extremizers and quadratic dual certificates for every finite budget, so the bounds are exact rather than discretized or parametric. The first useful budget, three votes, has a closed form, width at most $1/8$, and a certified-improvement criterion. The infinite-vote endpoint is the limit of majority voting as the number of calls tends to infinity; it is also sharply bounded, but remains threshold-sensitive because it depends on latent mass around $q=1/2$. We add maximum-entropy and Latent-difficulty Gaussian-probit point completions, and experiments on LLM calls over QNLI and QQP show that empirical three- and five-vote accuracies are contained in the projected two-call regions while temperature changes and randomized model mixtures can create voting gains not ordered by one-call accuracy.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.03379
カテゴリ: cs.LG, cs.CL

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報