| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by bubblyworld 648 days ago
	My barely-informed guess is that they don't have the resources to run it (it's a 200b+ model).

2 comments

regularfry 648 days ago

They could compare to DeepSeek-Coder-V2-Lite-Instruct. That's a 16B model, and it comes out at 24.3 on LiveCodeBench. Given the size delta they're respectably close - they're only just behind at 23.4. The full V2 is way ahead.

link

smcleod 647 days ago

That’s for the larger model, most people running it locally use the -lite model (both of which has lots of benchmarks published)

link