Hacker News new | ask | show | jobs
by ChrisArchitect 510 days ago
Related:

Qwen2.5-Max: Exploring the Intelligence of Large-Scale Moe Model

https://news.ycombinator.com/item?id=42853741