AIDB Daily Papers

SemPiper：機械学習パイプラインにおけるセマンティックオペレータのための対話型コード合成

原題: SemPiper: Interactive Code Synthesis for Semantic Operators in Machine Learning Pipelines

著者: Olga Ovcharenko, Luciano Duarte, Sebastian Schelter

公開日: 2026-06-12 | 分野: LLM 機械学習プログラミング cs.DB cs.LG AI支援

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

機械学習パイプライン開発を支援するため、自然言語指示からコードを生成する新しいプログラミングモデルSemPipesを提案した。
LLMの柔軟性と従来のコードの制御性・最適化可能性を両立させることで、実用的なLLM統合を実現する点が重要である。
データセット特性に基づき、パイプライン学習時にセマンティックオペレータの特殊実装を合成し、対話型インターフェースで可視化・最適化する結果を得た。

Abstract

Machine learning (ML) pipelines require extensive data preparation, feature engineering, and integration across heterogeneous sources, making them tedious and error-prone to develop. While large language models (LLMs) have recently shown promise for assisting programming tasks, chat-based interfaces provide limited control over pipeline behavior and often produce code that is difficult to optimize or integrate into production systems. We demonstrate SemPipes, a novel programming model that extends ML pipelines with declarative, LLM-powered semantic data operators. SemPipes allows developers to specify high-level natural language instructions for data-centric operations, while seamlessly combining these operators with arbitrary Python code from standard data science libraries. For the semantic operators, it synthesizes specialized implementations at pipeline training time, conditioned on dataset characteristics and pipeline context, enabling the flexible yet controlled integration of LLM capabilities. We demonstrate SemPipes through SemPiper, an interactive interface that visualizes computational graphs of the pipelines, synthesized operator implementations, and optimization trajectories produced by an evolutionary search procedure. Attendees can explore three end-to-end scenarios, modify pipelines, inspect generated code, and observe how semantic operators are synthesized and iteratively optimized. The demonstration highlights how declarative semantic operators enable controllable, optimizable, and practical integration of LLMs into ML pipeline development.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2606.14361
カテゴリ: cs.LG, cs.DB

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報