| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by osanseviero 466 days ago

Hi! Omar from the Gemma team here.

Last time we only released the quantized GGUFs. Only llama.cpp users could use it (+ Ollama, but without vision).

Now, we released the unquantized checkpoints, so anyone can quantize themselves and use in their favorite tools, including Ollama with vision, MLX, LM Studio, etc. MLX folks also found that the model worked decently with 3 bits compared to naive 3-bit, so by releasing the unquantized checkpoints we allow further experimentation and research.

TL;DR. One was a release in a specific format/tool, we followed-up with a full release of artifacts that enable the community to do much more.

1 comments

oezi 466 days ago

Hey Omar, is there any chance that Gemma 3 might get a speech (ASR/AST/TTS) release?

link