| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by SwellJoe 3 days ago
	DeepSeek is the fastest model in the benchmarks I've been doing (https://swelljoe.com/post/will-it-mythos/). Followed not so closely by Opus 4.8 and even less closely by Gemini 3.5 Flash and GPT 5.5. I've been really impressed with it, so far. It's also among the best at doing the work, though still trailing the frontier models from Anthropic and OpenAI.

1 comments

anschl 2 days ago

Nice benchmark, thanks! Which quants did you choose for the self hosted models?

link

SwellJoe 2 days ago

8-bit on that one (unsloth 8_K_XL). But, the next post compares all common quantizations of Qwen 3.6.

I have another coming in a day or so for Gemma 4 with the 4-bit QAT version, which is very surprising (in a good way, Gemma 4 is impressive for this task).

link