AIDB Daily Papers

LLMエージェントは言語学者みたいに話し言葉の地域変種を識別できるか？

原題: Can LLM Agents Identify Spoken Dialects like a Linguist?

著者: Tobias Bystrich, Lukas Hamm, Maria Hassan, Lea Fischbach, Lucie Flek, Akbar Karimi

公開日: 2026-03-31 | 分野: LLM NLP 音声機械学習言語研究自然言語処理深層学習

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

本研究では、LLMが音声の方言分類においてHuBERTなどのモデルに匹敵する性能を示せるかを検証した。
方言音声のデータ不足は課題だが、LLMは言語資源と組み合わせることで方言理解能力向上が期待できる点が新しい。
言語情報を提供するとLLMの予測精度が向上し、自動生成された文字起こしが分類に役立つ可能性を示唆した。

Abstract

Due to the scarcity of labeled dialectal speech, audio dialect classification is a challenging task for most languages, including Swiss German. In this work, we explore the ability of large language models (LLMs) as agents in understanding the dialects and whether they can show comparable performance to models such as HuBERT in dialect classification. In addition, we provide an LLM baseline and a human linguist one. Our approach uses phonetic transcriptions produced by ASR systems and combines them with linguistic resources such as dialect feature maps, vowel history, and rules. Our findings indicate that, when linguistic information is provided, the LLM predictions improve. The human baseline shows that automatically generated transcriptions can be beneficial for such classifications, but also presents opportunities for improvement.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2603.29541
カテゴリ: cs.CL

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報