|
|
|
|
|
by philipkglass
606 days ago
|
|
These quantized models show much less degradation compared to a "vanilla post-training-quantization" but there are a bunch of PTQ schemes that people have already applied to Llama models [1]. I didn't see any details about the vanilla PTQ they used as a baseline. Has it been written about elsewhere? [1] https://ollama.com/library/llama3.2/tags |
|