AIDB Daily Papers

動的要約生成による解釈可能なマルチモーダルうつ病検出

原題: Dynamic Summary Generation for Interpretable Multimodal Depression Detection

著者: Shiyu Teng, Jiaqing Liu, Hao Sun, Yu Li, Shurong Chai, Ruibo Hou, Tomoko Tateyama, Lanfen Lin, Yen-Wei Chen

公開日: 2026-04-13 | 分野: LLM NLP 音声マルチモーダル医療AI 解釈性 AI 動画テキスト深層学習臨床精神保健

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

大規模言語モデル（LLM）を活用し、正確かつ解釈可能なうつ病検出のための多段階フレームワークを提案した。
臨床症状の主観的な評価による信頼性の低いスクリーニングを、LLMによる詳細な臨床要約で補完し、透明性の高い根拠を提供する。
E-DAICおよびCMDCデータセットでの実験で、精度と解釈可能性の両方において最先端のベースラインを大幅に上回る改善を示した。

Abstract

Depression remains widely underdiagnosed and undertreated because stigma and subjective symptom ratings hinder reliable screening. To address this challenge, we propose a coarse-to-fine, multi-stage framework that leverages large language models (LLMs) for accurate and interpretable detection. The pipeline performs binary screening, five-class severity classification, and continuous regression. At each stage, an LLM produces progressively richer clinical summaries that guide a multimodal fusion module integrating text, audio, and video features, yielding predictions with transparent rationale. The system then consolidates all summaries into a concise, human-readable assessment report. Experiments on the E-DAIC and CMDC datasets show significant improvements over state-of-the-art baselines in both accuracy and interpretability.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2604.11334
カテゴリ: cs.AI

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報