AIDB Daily Papers

LLM生成テキストに潜む人間らしさの検出と強化

原題: Hidden Human-Like Nature of Machine-Generated Texts: Theory and Detection Enhancement

著者: Chenwang Wu, Yiu-ming Cheung, Bo Han, Defu Lian

公開日: 2026-05-22 | 分野: LLM 機械学習 AI 自然言語処理 cs.CL

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

LLM生成テキストに人間らしい部分が存在することを発見し、それが検出を困難にしていることを理論的に分析しました。
人間らしい部分の影響を低減するモデル非依存のスタック型強化フレームワークを提案しました。
提案手法は既存の検出器を常に強化し、訓練不要なモードでも実用的な柔軟性と拡張性を提供することを示しました。

Abstract

Machine-generated texts (MGTs) produced by large language models (LLMs) are increasingly prevalent across various applications, while their potential misuse in fake news propagation and phishing has raised serious concerns, highlighting the need for MGT detection. Existing paragraph-level detection methods commonly treat MGTs as entirely machine-like, overlooking the hidden human-like nature of machine-generated texts: even fully machine-generated texts may contain spans that are highly consistent with human writing. To this end, we first reveal the existence of such hidden human-like spans, and then theoretically analyze their impact on detection. Our analysis shows that these spans increase the sentence complexity for detection, thereby making MGT detection intrinsically harder. Based on this finding, we propose a model-agnostic stacked enhancement framework that improves existing detectors by reducing the influence of hidden human-like spans. Specifically, we model span-level retention decisions as a latent-variable problem and instantiate the optimization with a hard-EM-inspired procedure, where the detector iteratively filters confidently human-like subsequences and refines itself on the remaining text. Extensive experiments across various LLMs and practical scenarios demonstrate that the proposed framework consistently enhances existing detectors. Notably, the framework can also work in a training-free manner, offering flexibility and scalability for practical deployment.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.23190
カテゴリ: cs.CL

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報