Hacker News new | ask | show | jobs
by seba_dos1 1171 days ago
> I'm talking about a mental model for understanding and predicting when and why it works well vs when it doesn't.

I'm talking about it too. If I enter a specific phrase into a search engine that can be only found on a handful of websites, I expect it to return those results to me. Like, typing the VAT ID of my company will return bunch of information about it on various sites. This is absolutely not going to work with a LLM - instead, at best it may notice that what you typed looks like a VAT ID and will then proceed to give you information about a company it completely made up. The mental model of understanding what works with LLMs and doesn't is drastically different from a search engine. Human memory on steroids is a much better (though of course still not perfect) model.

1 comments

Again, we seem to be talking past each other, sorry. I'm really, really, really not talking about the search engine itself. I'm talking about the hunk of tech that makes up the interface layer between the human and the search engine, and the fact that that hunk of tech can be hooked up to a search engine is interesting but not entirely germane.

If using the analogy of human memory works for you - that's great! To me, it's not as good a fit, but that's ok.

> The mental model of understanding what works with LLMs and doesn't is drastically different from a search engine

Agreed! But again, that's not what I'm talking about. :)

> I'm talking about a mental model for understanding and predicting when and why it works well vs when it doesn't.

That's what you said earlier you were talking about, and that's what I replied to. Now you're saying that you're in fact not talking about "the mental model of understanding what works with LLMs and doesn't" at all. Seems you have to improve your communication skills mate ;]

What I'm saying is that using LLMs while imagining them to be kinda like search engines is just a way to get burned by hallucinations and disappointed with poor results. They don't work even remotely similar to search engines, neither internally nor for an external observer. For some kinds of input they may trick you into believing they actually do, but that impression will fall apart pretty quickly once you try to actually exercise it. That's how you get people who are genuinely shocked that ChatGPT gave them references to papers that were completely made up, for example - which is something that shouldn't surprise anyone using this tech at all, as that's just how it works.