| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by shaial 895 days ago
	Congrats, this looks neat, and surely great to have more TS products in the ecosystem. One plugin or feature that I will like to see in an AI gateway: Cache per unique request. So if I send the same request (system, messages, temperature, etc.), I will have the option to pull if from a cache (if it was already populated) and skip the LLM generation. This is much faster and cheaper - especially during development and testing.

1 comments

retrovrv 895 days ago

Thank you! We have built out the cache system -- we do both simple caching (matching the request strings 100%) and also do semantic caching (returning a cache hit for semantically similar requests). More here - https://portkey.ai/docs/product/ai-gateway-streamline-llm-in...

The caching part isn't open source yet, but part of our internal workers. Would be very cool to open source it!

link

shaial 893 days ago

Awesome! We built the simple version in-house, and hoped someone would productize it.

link