Y
Hacker News
new
|
ask
|
show
|
jobs
by
MallocVoidstar
872 days ago
That isn't unquantized, it's
de
-quantized. They went from Q5 to fp16 for use in Pytorch instead of the GGUF ecosystem.
2 comments
Taek
872 days ago
I never thought people would be upscaling models by increasing quantization precision. The rationale makes sense bit its also a goofy outcome.
link
nullc
872 days ago
You should be able to upscale and fine tune to recover performance, I suppose!
Clearly we should train a diffusion model to denoise the weights of LLM transformer models. Yo dawg.
link
throwaway9274
872 days ago
Yes, that’s correct. Good correction.
link