| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Layvier 936 days ago
	I'd be curious to hear what people think is state of the art of this kind of problem? I think the Cohere embeddings model v3 is very good, as it handles queries and documents differently to embed them in the same vector space. Otherwise for a specific use case I guess dense retrieval (which is basically a problem specific fine tuned version of this approach) is the best way to go about it?

3 comments

teaearlgraycold 936 days ago

> it handles queries and documents differently to embed them in the same vector space

I assume this might be behind the state of the art, then. As of a year ago OpenAI managed to get an embeddings model that can do both without any special flag or dual model to treat queries and documents differently.

link

mahjongmen1998 925 days ago

handling queries and documents differently leads to an increase in overall retrieval.

link

dmezzetti 936 days ago

If you're talking about embeddings model metrics, the MTEB Leaderboard is a good resource: https://huggingface.co/spaces/mteb/leaderboard

link

Layvier 936 days ago

Yes I had a look at it, I haven't tried e5-mistral-7b-instruct yet but I'll definitely give it a go. Is there such a leaderboard only focused on retrieval by any chance? I haven't found one so far

link

dmezzetti 936 days ago

The BEIR project might be what you're looking for: https://github.com/beir-cellar/beir/wiki/Leaderboard

link

pzar 935 days ago

https://huggingface.co/spaces/mteb/leaderboard

see the retrieval tab

link

viraptor 936 days ago

It's interesting how the massive model of e5-mistral has only marginal performance gains over the bge-base and similar ones. It could still be useful for the longer sentence length though.

link

mahjongmen1998 925 days ago

e5-mistral is essentially a distillation from gpt-4 to a smaller model. You can see here https://github.com/microsoft/unilm/blob/16da2f193b9c1dab0a69...

they actually have custom prompts for each dataset being tested.

Question would be, if you haven't seen the task before, what is a good prompt to prepend for your task?

IMO e5-mistral is overfit to MTEB

link

charcircuit 936 days ago

>I'd be curious to hear what people think is state of the art of this kind of problem?

I would assume Amazon's product suggestion would be SotA for reccomending a book based off of another. It is a recommondation system and while it uses semantic search there are many more ranking signals it uses.

link

lukev 936 days ago

The problem with systems like that is they're going to be optimized for making Amazon the most money, not necessarily what's best for the user (unless the two happen to coincide.)

link

charcircuit 936 days ago

Money is a proxy for user value and it is in Amazon's best interest for the customer to continually act upon these recommendations. If Amazon fails to deliver user value customers will be less willing to continue paying.

link

boolemancer 936 days ago

> Money is a proxy for user value and it is in Amazon's best interest for the customer to continually act upon these recommendations. If Amazon fails to deliver user value customers will be less willing to continue paying.

You're talking about the same Amazon that will blindly give me recommendations for other products that fill exactly the same niche as the one I just bought.

"Oh, you just bought a generator? You probably need a second one too, right?"

link

lukev 936 days ago

lol. lmao.

link