Hacker News new | ask | show | jobs
by mdagostino 1210 days ago
I wouldn’t say most—maybe a factor of 2. Getting the embedding is still an API call to an LLM.
1 comments

I’m pretty sure they were using a high cost LLM to summarize, and for embeddings you only need Ada, which is orders pf magnitude cheaper.