Hacker News new | ask | show | jobs
by blibble 218 days ago
> I'm just an AI researcher, what do I know?

me too! what do I know?

(at least now we know where the push for this dreadful policy is coming from)

1 comments

The whole purpose RLVR alignment is to ensure objectively correct outputs.