AIDB Daily Papers

LLM操縦自在: 大規模言語モデルを操るためのツールキット「AI Steerability 360」

原題: AI Steerability 360: A Toolkit for Steering Large Language Models

著者: Erik Miehling, Karthikeyan Natesan Ramamurthy, Praveen Venkateswaran, Irene Ko, Pierre Dognin, Moninder Singh, Tejaswini Pedapati, Avinash Balakrishnan, Matthew Riemer, Dennis Wei, Inge Vejsbjerg, Elizabeth M. Daly, Kush R. Varshney

公開日: 2026-03-08 | 分野: LLM ベンチマーク AI オープンソースプロンプト開発 Python

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

LLMを操縦するためのPythonライブラリ「AI Steerability 360」を開発し、オープンソースで公開しました。
入力、構造、状態、出力の4つの制御面からモデルを操縦する共通インターフェースを提供し、操縦方法の組み合わせを可能にします。
タスク定義と性能比較のためのベンチマーククラスにより、操縦方法の包括的な評価と比較が容易になりました。

Abstract

The AI Steerability 360 toolkit is an extensible, open-source Python library for steering LLMs. Steering abstractions are designed around four model control surfaces: input (modification of the prompt), structural (modification of the model's weights or architecture), state (modification of the model's activations and attentions), and output (modification of the decoding or generation process). Steering methods exert control on the model through a common interface, termed a steering pipeline, which additionally allows for the composition of multiple steering methods. Comprehensive evaluation and comparison of steering methods/pipelines is facilitated by use case classes (for defining tasks) and a benchmark class (for performance comparison on a given task). The functionality provided by the toolkit significantly lowers the barrier to developing and comprehensively evaluating steering methods. The toolkit is Hugging Face native and is released under an Apache 2.0 license at https://github.com/IBM/AISteer360.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2603.07837
カテゴリ: cs.CL, cs.AI

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報