Hacker News new | ask | show | jobs
by turblety 384 days ago
Maybe one day they’ll have an actual api that you can pay per token. Right now it’s the standard “talk to us” if you want to use it.
2 comments

Although not obvious, you _can_ pay them per token. You have to use OpenRouter or Huggingface as the inference API provider.

https://cerebras-inference.help.usepylon.com/articles/192554...

Oh, this is cool. Didn’t know they are on openrouter. Thanks.
Interestingly, llama 4 maverick isn't available on that page, only scout.
Yeap looks like it’s just scout and lower.
Huh? Just make an account, get your API key, and try out the free tier.. works for me.

https://cloud.cerebras.ai

Yep, can confirm, I used their API just fine for Llama 4 Scout for weeks now.
> that you can pay per token