Y
Hacker News
new
|
ask
|
show
|
jobs
by
colechristensen
40 days ago
No, they just need to be trained to have adversarial self review "thinking" processes.
You ask an LLM "What's wrong with your answer?" and you get pretty good results.
1 comments
binary0010
40 days ago
Or you get the original output result was perfect and the adversarial "rethinking" switches to an incorrect result.
link
byzantinegene
40 days ago
this seems to happen far more than i would like
link