Hacker News new | ask | show | jobs
by phoerious 218 days ago
I'm just an AI researcher, what do I know?
1 comments

> I'm just an AI researcher, what do I know?

me too! what do I know?

(at least now we know where the push for this dreadful policy is coming from)

The whole purpose RLVR alignment is to ensure objectively correct outputs.