| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by arrow7000 1191 days ago
	Isn't this exactly how GANs work already?

1 comments

jedberg 1191 days ago

Yes. But from I've seen no one has applied it to the latest Generative AIs.

link

dereg 1190 days ago

I’m pretty sure Anthropic’s Claude is doing that.

https://scale.com/blog/chatgpt-vs-claude

link

arrow7000 1191 days ago

Maybe an adversarial approach was used in training these models in the first place?

link

sharemywin 1190 days ago

It was they were' trained using reinforcement learning with human feedback to create the critic.

link

jedberg 1190 days ago

I hadn't thought about human feedback being an adversarial system, but I guess that makes sense, since it's basically a classifier saying "you got this wrong".

link