| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by throwawaydummy 692 days ago
	I wanna meet the person who greps die, kick the bucket and buy the farm lol Are models like mistral there yet in terms of token per second generation to run a grep over millions of files?

1 comments

ignoramous 692 days ago

Mistral has published large language models, not embedding models? sgrep uses Google's Word2Vec to generate embeddings of the corpus and perform similarity searches on it, given a user query.

link

throwawaydummy 692 days ago

No I got that I asked because wouldn’t embedding generated by fine tuned transformer based LLMs be more context aware? Idk much about the internals so apologies if this was a dumb thing to say

link

ignoramous 692 days ago

embeddings come in handy to augment LLMs [0], but as you suspect, some try LLMs themselves as an outright embedding model with varying degrees of success: https://www.reddit.com/r/LocalLLaMA/comments/12y3stx/embeddi... / https://huggingface.co/spaces/mteb/leaderboard

[0] https://simonwillison.net/2023/Oct/23/embeddings/

link