AIDB Daily Papers
AI健康コーチとシミュレーターの相互学習による行動変容支援
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- AI健康コーチとクライアントシミュレーターを相互に共同学習させるフレームワークを提案した。
- 既存手法の片側最適化の限界を克服し、より効果的なAI健康コーチの開発を目指した点が重要である。
- 実験の結果、提案手法が多角的にコーチングの質を向上させることを実証した。
Abstract
Motivational-interviewing-based health coaching is an effective approach for improving mental health and promoting healthy behavior change. However, the scarcity of trained human coaches and the high cost of coaching services make such support inaccessible to many people who could benefit from it. This motivates the development of AI health coaches that can provide scalable and affordable support. Existing methods typically optimize only one side of the interaction: they either train a dialogue agent against a fixed client environment or train a client simulator against a fixed assistant. This one-sided setup can limit exploration of the interaction space and may be inefficient at developing the capabilities required by the target agent and pushing its performance boundaries. In this paper, we propose a dual-agent framework that interactively co-trains both the health coach agent and the client simulator. The coach is optimized with DPO using Pareto-dominant response pairs identified by a multi-dimensional LLM judge. In turn, the client is trained adversarially by reversing these preferences, inducing an implicit adversarial training dynamic. We further show that this co-training process admits a natural stochastic-game interpretation. Extensive experiments demonstrate that our method effectively improves coaching quality across several important dimensions.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: