Y
Hacker News
new
|
ask
|
show
|
jobs
by
arrow7000
1191 days ago
Isn't this exactly how GANs work already?
1 comments
jedberg
1191 days ago
Yes. But from I've seen no one has applied it to the latest Generative AIs.
link
dereg
1190 days ago
I’m pretty sure Anthropic’s Claude is doing that.
https://scale.com/blog/chatgpt-vs-claude
link
arrow7000
1191 days ago
Maybe an adversarial approach was used in training these models in the first place?
link
sharemywin
1190 days ago
It was they were' trained using reinforcement learning with human feedback to create the critic.
link
jedberg
1190 days ago
I hadn't thought about human feedback being an adversarial system, but I guess that makes sense, since it's basically a classifier saying "you got this wrong".
link