AIDB Daily Papers

ハードウェアフィードバックによる組み込みAIの反復最適化：Embedded Arena

原題: Embedded Arena: Iterative Optimization via Hardware Feedback

著者: Zhihan Zhang, Alexander Le Metzger, Jiuyang Lyu, Chun-Cheng Chang, Jiayi Shao, Yujia Liu, Emmanuel Azuh Mensah, Edward Wang, Kurtis Heimerl, Gregory D. Abowd, Shwetak Patel, Natasha Jaques, Vikram Iyer

公開日: 2026-06-15 | 分野: AI 最適化ハードウェア cs.AI cs.AR AIエージェント

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

本研究では、組み込みデバイス向けのAIモデルを、メモリや電力などの物理的制約下で自動的に最適化する手法を提案した。
ハードウェアフィードバックを組み込んだLLMエージェントが、モデルとファームウェアを反復的に洗練させることで、従来の手動最適化を超える。
提案手法は、ビジョンモデルで250倍、オーディオモデルで400倍の圧縮を達成し、実用的な応用例として検出カメラやウェアラブルデバイスでの成功を示した。

Abstract

Embedded devices from wildlife monitoring stations to clinical wearables require local AI inference due to latency, communication, or privacy constraints. Optimizing models for heterogeneous microcontrollers (MCUs) requires simultaneously satisfying hard physical constraints on memory, power, and temperature while preserving accuracy, a multidimensional optimization that is today performed manually by experts. We ask whether an LLM agent can autonomously navigate this complex, multi-turn pipeline guided by real hardware feedback, and introduce a hardware-in-the-loop agent arena in which the agent iteratively refines both model and firmware -- compiling, flashing, and measuring on real hardware -- to enable closed-loop optimization. Frontier models, including Claude Opus 4.7 and Gemini 3.1 Pro, fail entirely without hardware feedback (0% deployment success), whereas our hardware-in-the-loop formulation achieves the first successful deployment within three iterations and can surpass human expert results within seven. This agentic co-optimization achieves 250x compression for vision models with <3.3% accuracy loss and 400x for audio with <6% Feature Error Rate loss, enabling battery-free operation on a commercial MCU via solar harvesting. We demonstrate practical impact in two real-world systems: an elk-detection camera trap (96.7% accuracy) and a phonetic-transcription wearable (8.44% FER) for child development research.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2606.16190
カテゴリ: cs.AR, cs.AI

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報