Hacker News new | ask | show | jobs
by HyprMusic 544 days ago
Since you're calling out your support for underserved models, can I request you support some SOTA embeddings models? Support for embeddings is poor from other providers with only a handful of outdated models and poor latency.
1 comments

Hey, great that you mentioned this. We actually had BAAI/bge-m3 on our list of models to put up in the near future to see if people had use for it over an API. It's great to hear that this is something you're looking for. If you could let us know if there was a specific model you wanted to run, we can look into getting that put up soon.
Colbert, colqwen are underserved would benefit from a latency optimized inference service
Awesome, we really appreciate the suggestions! We'll look into getting these up and running shortly!