AIDB Daily Papers

AI生成マルウェアのゼロデイ検出：LLM主導分析と相乗的な実行戦略

原題: Synergistic Directed Execution and LLM-Driven Analysis for Zero-Day AI-Generated Malware Detection

著者: George Edwards, Mahdi Eslamimehr

公開日: 2026-03-10 | 分野: LLM Transformer 安全性セキュリティ機械学習 AI リスク自動化 Python 解析

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

AI生成マルウェアの検出に向け、記号実行とLLMによる経路優先度付けを組み合わせたハイブリッド分析フレームワークを提案。
従来の検出手法を回避するAI生成マルウェアに対し、プログラム実行追跡の論理的分析で検出の健全性と完全性を保証する。
実験では、既存のマルウェアとAI生成マルウェアに対し高い精度を示し、他のベースラインを大幅に上回る性能を達成。

Abstract

The weaponization of LLMs for automated malware generation poses an existential threat to conventional detection paradigms. AI-generated malware exhibits polymorphic, metamorphic, and context-aware evasion capabilities that render signature-based and shallow heuristic defenses obsolete. This paper introduces a novel hybrid analysis framework that synergistically combines emph{concolic execution} with emph{LLM-augmented path prioritization} and emph{deep-learning-based vulnerability classification} to detect zero-day AI-generated malware with provable guarantees. We formalize the detection problem within a first-order temporal logic over program execution traces, define a lattice-theoretic abstraction for path constraint spaces, and prove both the emph{soundness} and emph{relative completeness} of our detection algorithm, assuming classifier correctness. The framework introduces three novel algorithms: (i) an LLM-guided concolic exploration strategy that reduces the average number of explored paths by 73.2% compared to depth-first search while maintaining equivalent malicious-path coverage; (ii) a transformer-based path-constraint classifier trained on symbolic execution traces; and (iii) a feedback loop that iteratively refines the LLM's prioritization policy using reinforcement learning from detection outcomes. We provide a comprehensive implementation built upon texttt{angr} 9.2, texttt{Z3} 4.12, Hugging Face Transformers 4.38, and PyTorch 2.2, with configuration details enabling reproducibility. Experimental evaluation on the EMBER, Malimg, SOREL-20M, and a novel AI-Gen-Malware benchmark comprising 2{,}500 LLM-synthesized samples demonstrates that achieves 98.7% accuracy on conventional malware and 97.5% accuracy on AI-generated threats, outperforming ClamAV, YARA, MalConv, and EMBER-GBDT baselines by margins of 8.4--52.2 percentage points on AI-generated samples.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2603.09044
カテゴリ: cs.CR, cs.SE

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報