Hacker News new | ask | show | jobs
by theo31 1808 days ago
There is no cold start! We keep your service hot all the time.
1 comments

Well, I guess I know where I am going to host GPT-J-6B then. I don't think it is sustainable.
How are you planning to put a gpt whatever when the service clearly have a model size limit?!
The size limit is very close to allowing it (12GB vs 10GB). I imagine you can reduce it somewhat further and get it to fit.