Hacker News new | ask | show | jobs
by ipsum2 1100 days ago
Doesn't work on image classifiers, because there's no KV cache. Also, standard image classifiers can do 100-1000 images/sec without any optimizations.
1 comments

It's not really fast if makes intense usage of a GPU
...what?