| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by GiorgioG 179 days ago
	If only LLMs didn’t just make shit up regularly.

2 comments

ltbarcly3 179 days ago

They both make stuff up and make very obvious mis-interpretations of evidence. If you take the output of an LLM, and ask another LLM to check it, this dramatically reduces this. Even if you do it with the same LLM but without the existing context. I was able to write a detailed analysis of a rule system by doing this with 3 steps, claude -> chatgpt -> gemini3. It caught all the mistakes, including overstatements and vague statements. It wasn't perfect, but even after one review the # of mistakes or stupid statements was almost 0.

link

erichocean 179 days ago

If a coding agent was released that never made anything up, how much would that change things for you?

link

geophph 179 days ago

I’d save a lot of time from not choosing to smugly telling the AI how wrong it was just for my own reassurances that at least for now I’m still more useful than it is.

link