AIDB Daily Papers

人間が見る環境を大規模言語モデルで定量化し、メンタルヘルスとの関連を解明

原題: Quantifying the human visual exposome with vision language models

著者: Christian Rominger, Andreas R. Schwerdtfeger, Malay Gaherwar Singh, Dimitri Khudyakow, Elizabeth A. M. Michels, Fabian Wolf, Jakob Nikolas Kather, Magdalena Katharina Wekenborg

公開日: 2026-05-05 | 分野: LLM NLP コンピュータビジョン AI 研究メンタルヘルスデータ分析 cs.AI cs.CV AIエージェント

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

人間の視覚環境を定量化するため、生態学的瞬間評価とVLMを組み合わせた手法を開発した。
この研究は、従来の環境評価手法の限界を超え、個人の視覚体験を客観的に捉える点で重要である。
VLMによる環境特徴量の推定は、個人の感情やストレスレベルと有意な相関を示した。

Abstract

The visual environment is a fundamental yet unquantified determinant of mental health. While the concept of the environmental exposome is well established, current methods rely on coarse geospatial proxies or biased self reports, failing to capture the first person visual context of daily life. We addressed this gap by coupling ecological momentary assessment with vision language models (VLMs) to quantify the semantic richness of human visual experience. Across 2674 participant generated photographs, VLM derived estimates of greenness robustly predicted momentary affect and chronic stress, consistent with established benchmarks. We then developed a semi autonomous large language model (LLM) based pipeline that mined over seven million scientific publications to extract nearly 1000 environmental features empirically linked to mental health. When applied to real world imagery, up to 33 percent of VLM extracted context ratings significantly correlated with affect and stress. These findings establish a scalable objective paradigm for visual exposomics, enabling high throughput decoding of how the visible world is associated with mental health.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.03863
カテゴリ: cs.AI, cs.CV

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報