AIDB Daily Papers
LLMによる迅速レビューでソフトウェアツール発見を加速:ログ異常検知の事例
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- LLMを活用したスクリーニングとコード生成エージェントによる、ソフトウェアツールの迅速なレビューパイプラインを提案した。
- ログ異常検知ツールを対象に、LLMによるスクリーニングと実行可能性評価で、効率的なツール発見の重要性と新規性を示した。
- 3233件の論文から83件のツールを発見し、24件の実行可能なツールを特定、人間による作業時間を大幅に削減した。
Abstract
In software engineering research, the primary outcome is frequently a tool. However, for practitioners and academics alike, it is hard to tell which tools are maintained and do they work out of the box. In this paper, we propose a pipeline to identify relevant studies with LLM screening, extract the tools presented in them, and run them with LLM-based coding agent. To evaluate the feasibility of our approach we focus on software log anomaly detection tools. We begin the study by designing a broad search string that yields 3233 hits from Scopus. We request two LLMs to provide an inclusion probability for each title-abstract pair according to the inclusion and exclusion criteria. From the 3233 exported abstracts, this screening reduced the number of included papers to 569, out of which we could download 470. These papers included 206 unique links and after manual evaluation we determined 83 to be tools. Finally, we ran the LLM-based coding agent on these 83 links, and got 24 successfully running tools. As replicating our approach would require roughly only 4 hours of human effort, of which 3 hours were manual PDF downloading, and 12 hours of LLM running time, this demonstrates promising efficiency when utilizing LLMs in rapid reviews. Because practitioner-built tools often lack academic papers, in the future we aim to expand our analysis to tool-hosting platforms such as GitHub and PyPI. In the future, we plan to formalize our workflow as LLM Agent Skills to make our approach easier to adopt.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: