| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by dools 29 days ago
	Kimi k2.6 is about on par with GPT 5.2 so I’d say open weight models are about 6 months behind.

2 comments

cbg0 29 days ago

The Q4 quantization requires about 600GB of RAM without context, not exactly consumer hardware friendly.

link

janderland 29 days ago

Has Kimi found a way to vastly reduce the amount of VRAM required without running at 3 tokens per second? That’s the real concern.

link

dools 29 days ago

I said "open weight" rather than "local". I mean, local if you have $240k to drop on GPUs but you can run Kimi k2.6 on a B300 cluster for ~$50/hour too.

link