Hacker News new | ask | show | jobs
by karagenit 218 days ago
Looks cool! Does it support prompt caching? And do you have any data showing how your latency compares to going directly to the model providers? I’m thinking about trying it out but those are my two big reservations.