Hacker News new | ask | show | jobs
by lukewarm707 74 days ago
i don't use local models, i just use the APIs of cloud providers (eg fireworks, together, friendli, novita, even cerebras or groq).

you can get subscriptions to use the APIs, from synthetic, or ollama, fireworks.

2 comments

I might be missing it, but does fireworks actually have a subscription? All I saw was serverless (per token) and gpu $/hr.

And since I saw a few other comments talking about these, do you have any preference on different cloud providers with ZDR? I look every once in a while and want to switch to completely open models and/or at least ZDR so I can start doing things like summarizing e-mail. I'm thinking I can probably split my use between some sort of cloud api and claude code for heavier tasks.

on fireworks, log in and go to the dashboard, then the subscription is available under "fire pass" (above the model library). the underlying compute is fireworks and is ZDR.

other cloud providers, kind of depends on the use case. fireworks. inception labs is frontier for the use case. otherwise a lot of the time I use TEE. tinfoil is very good, easy to integrate using their cli proxy, fairly expensive. phala is less expensive but slow, more work to integrate.

Whats the big difference then? You can get a lot of tokens for 20$ and not everything is a state secret i'm doing.

But if i would use some API stuff, probably openrouter, isn't that easer to switch around and also have zero konwledge savety?

i think that privacy is good for wellbeing. it may be this is a dying point of view.
It is for sure but running your own email is so time intense that i gave that up 10 years ago.

i then decided to trust one company with most stuff.

Also as I said, I would use something different for my personal stuff. But i'm waiting for the right hardware etc.