|
|
|
|
|
by brigadier132
880 days ago
|
|
This analysis is bad. The embedding is generated once. Search is done whenever a user inputs a query. The cosine similarity is also not done on a single embedding, it's done on millions or billions of embeddings if you are not using an index. So what the actual conclusion is, is that once you have a billion embeddings a single search operation costs as much as generating an embedding. But then, you are not even taking into account the massive cost of keeping all of these embeddings in memory ready to be searched. |
|