Hacker News new | ask | show | jobs
by cxie 1172 days ago
It looks like a game-changer for those working with LLM services. By caching query results, it effectively cuts down the number of requests and token count sent to the LLM service, leading to a substantial reduction in overall costs.

If you're leveraging LLMs for your projects, it's definitely worth giving GPTCache a look!