Hacker News new | ask | show | jobs
Token-Count-Based Batching: Faster, Cheaper Embedding Inference for Queries (mongodb.com)
1 points by fzliu 175 days ago