AIDB Daily Papers
MindMirror:デジタルワーカーのためのローカルファーストなマルチモーダル状態認識型サポートシステム
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- デジタルワーカーの疲労や集中力低下を軽減するため、カメラ映像やテキストから状態を認識し、ローカルLLMが応答を生成するシステムを開発した。
- 既存ツールと異なり、ユーザーの状態を能動的に認識・支援し、ローカルファースト設計でプライバシーと制御性を高めた点が重要である。
- 顔表情認識モジュールの精度が大幅に向上し、ユーザーからはローカルファースト設計や対話ワークフローが評価された。
Abstract
Digital workers often experience fatigue, anxiety, reduced attention, and task blockage during prolonged computer-based work. Existing productivity tools mainly focus on task completion, while general-purpose AI chatbots require users to formulate clear prompts before receiving useful help. This paper presents MindMirror, a local-first multimodal state-aware support system for digital workers. MindMirror integrates camera-based facial expression cues, text input, optional speech interaction, structured blockage reflection, local large language model (LLM)-based response generation, and daily/weekly review reports. The system forms a closed workflow of state checking, manual correction, structured articulation, suggestion generation, and state review. The current prototype follows a local-first design, while optional speech services may rely on third-party APIs when enabled. It is implemented with a Web frontend, Flask backend, an emotion recognition model, an Ollama-hosted Qwen model, Chart.js visualization, and local JSON/LocalStorage records. We evaluate the emotion recognition module on an independent seven-class image-level facial expression benchmark containing 6,767 images. The fine-tuned Hugging Face model improves accuracy from 59.66% to 94.49% over a non-fine-tuned checkpoint baseline, an absolute gain of 34.83 percentage points. We further validate the prototype through endpoint-level reliability tests, voice-interaction latency tests, and a small formative user feedback study with six digital workers. Results suggest that users value the local-first design, manual correction mechanism, and structured reflection workflow. MindMirror is not intended for psychological diagnosis; instead, it serves as a lightweight, user-controllable tool for state reflection and supportive interaction.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: