AIDB Daily Papers
LLMオープンソース界の破壊的モデルを特定する:技術革新の震源地はどこか?
※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。
ポイント
- Hugging Face上の250万件以上のモデルを分析し、LLMの系譜ネットワークを再構築しました。
- 既存技術を強化するモデルと、新たな開発基盤となるモデルを区別する指標(MDI)を導入しました。
- 大規模モデルやファインチューニング戦略が、破壊的なモデルを生み出す可能性が高いことが示唆されました。
Abstract
The rapid growth of open-source large language models (LLMs) has created a complex ecosystem of model inheritance and reuse. However, existing research has focused mainly on descriptive analyses of lineage evolution, with limited attention to identifying which models play a disruptive role in shaping subsequent development. Using metadata from 2,556,240 models on Hugging Face, this study reconstructs a large-scale lineage network and introduces the Model Disruption Index (MDI) to distinguish between models that reinforce existing technological trajectories and those that become new bases for later development. The results show that most models in the open-source LLM community are consolidative rather than disruptive, reflecting a highly concentrated and path-dependent evolutionary structure. Further analyses suggest that disruptive positions are more likely to emerge among large-scale models and through finetuning strategies. Overall, this study provides a new perspective for identifying disruptive models and understanding uneven technological development in open-source LLM ecosystems.
Paper AI Chat
この論文のPDF全文を対象にAIに質問できます。
質問の例: