|
|
|
|
|
by karmasimida
957 days ago
|
|
Embedding is poor man's context length increase. It essentially increases your context length but with loss. There is a cost argument to make still, embedding-based approach will be cheaper and faster, but worse result than full text. That being said, I don't see how those embedding startups compete with OpenAI, no one will be able to offer better embedding than OpenAI itself. It is hardly a convincing business. The elephant in the room is the open source models aren't able to match up to OpenAI models, and it is qualitative, not quantitive. |
|
> embedding-based approach will be cheaper and faster, but worse result than full text
I’m not sure results would be worse, I think it depends on the extent to which the models are able to ignore irrelevant context, which is a problem [2]. Using retrieval can come closer to providing only relevant context.
1. https://huggingface.co/spaces/mteb/leaderboard
2. https://arxiv.org/abs/2302.00093