Hacker News new | ask | show | jobs
by karmakaze 115 days ago
Chart of how these compare[0] to the Qwen3 235B-A22B, Next-80B-A3B-Thinking, 30B-A3B-Thinking, 4B, 1.7B models.

These new ones are very much punching above their weights.

[0] https://www.reddit.com/r/LocalLLaMA/comments/1rivckt/visuali...