Y
Hacker News
new
|
ask
|
show
|
jobs
by
rightbyte
619 days ago
Running models locally brings my beefy rig to the knees for about half a minute for each querry for smaller models. Answering querries has to be expensive too?