|
|
|
|
|
by Samin100
420 days ago
|
|
I have a few private “vibe check” questions and the 4 bit QAT 27B model got them all correctly. I’m kind of shocked at the information density locked in just 13 GB of weights. If anyone at Deepmind is reading this — Gemma 3 27B is the single most impressive open source model I have ever used. Well done! |
|
I think this means I either have to train the -pt model with my own instruction tuning or use another provider :(