Y
Hacker News
new
|
ask
|
show
|
jobs
by
petulla
1043 days ago
What's the inference time without gpu?
1 comments
lm2s
1043 days ago
It might the time mentioned at the bottom of the page since the author isn't sure that the GPU is being used:
>How to speed this up—right now my Llama prompts often take 20+ seconds to complete.
link
>How to speed this up—right now my Llama prompts often take 20+ seconds to complete.