Y
Hacker News
new
|
ask
|
show
|
jobs
by
pickleballcourt
91 days ago
While your throughout is around 2x you still cost more then vercel ai model pricing for example for GLM-5:
https://vercel.com/ai-gateway/models?q=glm
Is this a result of renting more expensive gpus?
1 comments
2uryaa
90 days ago
Yes, we operate on GB200s and GH200s. Usually we are cheaper for many models and can get up to double the TPS.
link