Hacker News new | ask | show | jobs
by LASR 1142 days ago
What people forget is the underlying capability - LLMs are able to do reasoning.

So the one-track thinking of garbage-in-garbage-out is not the limitation any more.

What we're precisely now able to do is garbage-in-less-garbage out.

You can take a vague prompt in and ask GPT to hypothesize on what it means, why the user is asking that question and then generate a detailed prompt. Then use that that prompt and then perform the search.

Trying this out in the playground, I see a suprisingly capable search experience.

1 comments

This kind of second-order (and higher-order) usage of LLMs is where things actually start to get much more interesting. The other thing you can do is just train a better model.

I use GPT-4 for debugging a lot now, because it's excellent at taking nothing other than an error message from the console and giving me back what's wrong and how to fix it. It's not perfect, but it's good enough that I reach for it by default now. I don't have API access to GPT-4 yet, and so I was comparing how well GPT-3.5 performed at this same task and for the example I tried, it just didn't get close enough for me to truly find it useful, so I wouldn't begin to rely on it in my daily workflow unlike GPT-4.

But... what I am actually quite interested in, and what I'm seeing a lot of, is exactly how far can you push a less capable model through prompt engineering? I think it's actually surprisingly further than you might have initially thought.