Hacker News new | ask | show | jobs
by Esophagus4 1 hour ago
Have you ever let the LLMs “discuss” with each other to see if that would give better answers?

You might end up with the answer from the most persuasive LLM, but you might also end up with better results.

Wonder if there is a paper out there on this.

2 comments

The problem is how do you know whether the answer is just the most persuasive or actually the most accurate one? It's hard to figure this out without domain knowledge.
The problem with trying to write a paper is the results depend on RNG.
That doesn't make it differrnt from any other problem measured by statistical significance in averaged over a big enough series of comparisons, no?