Hacker News new | ask | show | jobs
by pixl97 114 days ago
With the number of operations and the error rate in GPUs this is going to be interesting in SOTA models.
1 comments

Don't forget quantization..