| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by stereo 471 days ago
	Isn’t this basically the Swiss cheese model? If your two input AIs hallucinate, or your consensus AI misunderstands the input, you will still have confabulations in the output?

2 comments

TheKelsbee 471 days ago

I have this same thought, and have tried similar approaches.

OP: Have you trained or fine tuned a model that specifically reasons the worker model inputs against the user input? Or is this basically just taking a model and turning the temperature down to near 0?

link

kuberwastaken 470 days ago

Low temperature, heavy prompting to answer in a structured way. Sadly can't fine train models since this is API based but the approach does work!

link

kuberwastaken 470 days ago

From all my testing, this never really happened even once honestly, plus the judge model (that I've kept strictly a reasoning model) also evaluates individually before "judging" the consensus.

link