AIDB Daily Papers

LLMによるOracleからPostgreSQLへの移行におけるトークン最適化戦略

原題: Token Optimization Strategies for LLM-Based Oracle-to-PostgreSQL Migration

著者: Oleg Grynets, Dmytro Babarytskyi, Vasyl Lyashkevych

公開日: 2026-05-27 | 分野: LLM 最適化データベース cs.AI

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

本研究では、LLMを用いたOracleからPostgreSQLへのデータベース移行におけるトークン消費の課題を解決するため、12種類のトークン最適化戦略を提案・評価した。
既存手法はトークン消費が多く、文脈の劣化や意味のずれが生じるため、コストと品質のバランスを取る最適化が重要である。
適応的ルーティングはトークン数を削減しつつ意味の整合性を維持し、最も実用的なトレードオフを示した。

Abstract

LLMs are increasingly used for software modernization, code translation, and database migration. However, LLM-based Oracle2PostgreSQL migration remains constrained by high token consumption, long-context degradation, dialect-specific semantic differences, and the risk of semantic drift during query transformation. Direct inclusion of large Oracle SQL/PL-SQL artefacts, schema definitions, procedural logic, and migration instructions into the model context increases cost and may reduce generation quality. This paper shows token optimization as a constrained transformation problem in LLM-based Oracle2PostgreSQL migration. The study formalizes and evaluates twelve token optimization strategies: baseline representation, context pruning, minification, DSL-based semantic compression, metadata augmentation, context refactoring, schema distillation, adaptive routing, AST-based minification, identifier masking, output constraint enforcement, and hybrid optimization. The strategies are evaluated on samples of 10 and 100 Oracle SQL queries using Valid Syntax Rate, Exact Match, Semantic Match, CodeBLEU, and Token Efficiency. The results show that mild context pruning preserves semantic quality almost at the baseline level, achieving 89.75% Semantic Match on the 100-query sample compared with 89.80% for the unoptimized baseline. Adaptive routing provides the best practical trade-off, reducing input tokens by 8.72% and output tokens by 5.49% while maintaining 88.40% Semantic Match and increasing Token Efficiency by 6.67%. Aggressive schema distillation increases Token Efficiency by 132.22% but results in a 44.50-percentage-point decrease in Semantic Match. The findings demonstrate that token optimization cannot be treated as simple prompt shortening; it must be evaluated as a multi-objective migration problem balancing cost, syntactic validity, semantic preservation, and structural fidelity.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.28557
カテゴリ: cs.LO, cs.AI

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報