Tag: scale model
-
Hacker News: Qwen2.5-Max: Exploring the Intelligence of Large-Scale Moe Model
Source URL: https://qwenlm.github.io/blog/qwen2.5-max/ Source: Hacker News Title: Qwen2.5-Max: Exploring the Intelligence of Large-Scale Moe Model Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the development and performance evaluation of Qwen2.5-Max, a large-scale Mixture-of-Expert (MoE) model pretrained on over 20 trillion tokens. It highlights significant advancements in model intelligence achieved through scaling…