Hacker News new | ask | show | jobs
by michaelstewart 260 days ago
The lightweight model is incredibly cheap. Cost so far today $0.06. It within the same order of magnitude as the per-request cost of the write to cloudflare KV storage (which I'm using to cache the inference result).