AIDB Daily Papers

組み込み・IoTシステム開発を支援する熟練AIエージェント

原題: Skilled AI Agents for Embedded and IoT Systems Development

著者: Yiming Li, Yuhan Cheng, Mingchen Ma, Yihang Zou, Ningyuan Yang, Wei Cheng, Hai "Helen" Li, Yiran Chen, Tingjun Chen

公開日: 2026-03-20 | 分野: LLM ベンチマーク AI ソフトウェアエージェント自動化開発実験システム IoT 組み込みハードウェア

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

LLMとエージェントシステムを応用し、ハードウェアと連携した組み込み・IoTシステム開発の自動化を目指す研究。
ソフトウェアとハードウェアの密結合により、実機での動作検証が不可欠な領域で、AIエージェントの性能評価が重要となる。
スキルベースのエージェントフレームワークとベンチマーク「IoT-SkillsBench」を提案し、専門知識が成功率向上に寄与することを示した。

Abstract

Large language models (LLMs) and agentic systems have shown promise for automated software development, but applying them to hardware-in-the-loop (HIL) embedded and Internet-of-Things (IoT) systems remains challenging due to the tight coupling between software logic and physical hardware behavior. Code that compiles successfully may still fail when deployed on real devices because of timing constraints, peripheral initialization requirements, or hardware-specific behaviors. To address this challenge, we introduce a skills-based agentic framework for HIL embedded development together with IoT-SkillsBench, a benchmark designed to systematically evaluate AI agents in real embedded programming environments. IoT-SkillsBench spans three representative embedded platforms, 23 peripherals, and 42 tasks across three difficulty levels, where each task is evaluated under three agent configurations (no-skills, LLM-generated skills, and human-expert skills) and validated through real hardware execution. Across 378 hardware validated experiments, we show that concise human-expert skills with structured expert knowledge enable near-perfect success rates across platforms.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2603.19583
カテゴリ: cs.SE, cs.AI

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報