Hacker News new | ask | show | jobs
by petulla 1043 days ago
What's the inference time without gpu?
1 comments

It might the time mentioned at the bottom of the page since the author isn't sure that the GPU is being used:

>How to speed this up—right now my Llama prompts often take 20+ seconds to complete.