Hacker News new | ask | show | jobs
by pickleballcourt 91 days ago
While your throughout is around 2x you still cost more then vercel ai model pricing for example for GLM-5: https://vercel.com/ai-gateway/models?q=glm

Is this a result of renting more expensive gpus?

1 comments

Yes, we operate on GB200s and GH200s. Usually we are cheaper for many models and can get up to double the TPS.