Hacker News new | ask | show | jobs
by retrovrv 895 days ago
Thank you! We have built out the cache system -- we do both simple caching (matching the request strings 100%) and also do semantic caching (returning a cache hit for semantically similar requests). More here - https://portkey.ai/docs/product/ai-gateway-streamline-llm-in...

The caching part isn't open source yet, but part of our internal workers. Would be very cool to open source it!

1 comments

Awesome! We built the simple version in-house, and hoped someone would productize it.