Hacker News new | ask | show | jobs
by visarga 221 days ago
Solve the same task with ChatGPT, Gemini and Claude. If they agree, you can be reasonably sure.
1 comments

I'm not opposed to experimenting, but that's a a recipe for false confidence in a final decision.
Where they agree it shows the data supports that answer - not necessarily that it is true, where they disagree it shows you need to hedge. That's useful.
This is so wrong!

e.g., if you had a heart condition, you can't just poll three LLMs and be "reasonably sure" you've properly diagnosed the ailment.