Hacker News new | ask | show | jobs
by qeternity 385 days ago
I know. That was my point.

Throughput doesn't scale on CPU as well as it does on GPU.

1 comments

We both agree. Batch size 1 is only relevant to people who want to run models on their own private machines. Which is the case of OP.