Y
Hacker News
new
|
ask
|
show
|
jobs
by
bubblyworld
648 days ago
My barely-informed guess is that they don't have the resources to run it (it's a 200b+ model).
2 comments
regularfry
648 days ago
They could compare to DeepSeek-Coder-V2-Lite-Instruct. That's a 16B model, and it comes out at 24.3 on LiveCodeBench. Given the size delta they're respectably close - they're only just behind at 23.4. The full V2 is way ahead.
link
smcleod
647 days ago
That’s for the larger model, most people running it locally use the -lite model (both of which has lots of benchmarks published)
link