| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by GaunterODimm 464 days ago
	2days :/...

1 comments

rob_c 464 days ago

Given I know people running gemma3 on local devices for over almost a month now this is either a very slow news day or evidence of finger missing the pulse... https://blog.google/technology/developers/gemma-3/

link

simonw 464 days ago

This is new. These are new QAT (Quantization-Aware Training) models released by the Gemma team.

link

rob_c 464 days ago

There's nothing more than an iteration on the topic, gemma3 was smashing local results a month ago and made no waves as it dropped...

link

simonw 464 days ago

Quoting the linked story:

> Last month, we launched Gemma 3, our latest generation of open models. Delivering state-of-the-art performance, Gemma 3 quickly established itself as a leading model capable of running on a single high-end GPU like the NVIDIA H100 using its native BFloat16 (BF16) precision.

> To make Gemma 3 even more accessible, we are announcing new versions optimized with Quantization-Aware Training (QAT) that dramatically reduces memory requirements while maintaining high quality.

The thing that's new, and that is clearly resonating with people, is the "To make Gemma 3 even more accessible..." bit.

link

rob_c 464 days ago

As I've said in my lectures on how to perform 1bit training of QAT systems to build classifiers...

"An iteration on a theme".

Once the network design is proven to work yes it's an impressive technical achievement, but as I've said given I've known people in multiple research institutes and companies using Gemma3 for a month mostly saying they're surprised it's not getting noticed... This is just enabling more users but the none QAT version will almost always perform better...

link

simonw 464 days ago

Sounds like you're excited to see Gemma 3 get the recognition it deserves on Hacker News then.

link