| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by simianwords 407 days ago
	>Right now, most LLMs with web search grounding are still in Stage 1: they can retrieve content, but their ability to assess quality, trustworthiness, and semantic ranking is still very limited. Why do you think it is limited? Imagine you show a link with details to an LLM and ask it if it is trustworthy or high quality w.r.t the query, why can't it answer it?

2 comments

lgiordano_notte 407 days ago

Don't think the limit is in what LLMs can evaluate - given the right context, they’re good at assessing quality. The problem is what actually gets retrieved and surfaced in the first place. If the upstream search doesn’t rank high-quality or relevant material well, LLM never sees it. It's not a judgment problem, more of a selection problem.

link

RainbowcityKun 407 days ago

What I mean is that more powerful engineering capabilities are needed to provide LLM with processing of search results.

link

simianwords 407 days ago

Not sure I understand -- LLM's are pretty good at assessing quality of search results. If an LLM can bulk assess a bunch of results it can get a pretty far, probably more efficient than a human hand checking all the results.

link