| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by pstorm 987 days ago
	Couldn't you just compare the similarity of the embeddings? I imagine that would work in the vast majority of cases and save a lot of LLM calls.

1 comments

janchorowski 987 days ago

That's a good idea, the deduplication criterion is easy to change, using an llm is faster to get started, but after a while a corpus of decisions is created and can be used to either select another mechanism, or e.g. train one on top of bert embeddings.

link