Hacker News new | ask | show | jobs
by YetAnotherNick 143 days ago
Hosting the model is cheaper per token, the more batched token you get. So they have big advantage here.