|
|
|
|
|
by rob_c
417 days ago
|
|
As I've said in my lectures on how to perform 1bit training of QAT systems to build classifiers... "An iteration on a theme". Once the network design is proven to work yes it's an impressive technical achievement, but as I've said given I've known people in multiple research institutes and companies using Gemma3 for a month mostly saying they're surprised it's not getting noticed...
This is just enabling more users but the none QAT version will almost always perform better... |
|