| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by verdverm 41 days ago
	I run the biggest quant because it is more capable, spark has enough memory for two qwen at 8bit and full context length (roughly 48G each) I find gemini/gemma to have become worse at coding, they are better for non-coding tasks, but maybe not even that, the hallucinations and instruction following have both degraded ime