Hacker News new | ask | show | jobs
by tomgs 327 days ago
Neat! What’s the use case exactly? Kinda hard to figure from skimming
1 comments

When you have LLM requests you don't mind waiting for (up to 24h) then you can save 50% in costs. Great for document processing, image classification at scale, anything that you don't need an immediate result from the LLM provider and costs play a role.
Concrete use cases where 50 percent is actually a thing?