Hacker News new | ask | show | jobs
by p12tic 378 days ago
All of this is for batch size 1.
1 comments

I know. That was my point.

Throughput doesn't scale on CPU as well as it does on GPU.

We both agree. Batch size 1 is only relevant to people who want to run models on their own private machines. Which is the case of OP.