Hacker News new | ask | show | jobs
by mythz 124 days ago
Yep the Qwen team has been churning out models for basically everything, and lets not sleep the big blue whale that started it all which is rumored to have a 1M context drop coming soon [1]

[1] https://www.reddit.com/r/LocalLLaMA/comments/1r1snhv/deepsee...