Hacker News new | ask | show | jobs
by blibble 216 days ago
> The statement that correctness plays no role in the training process is objectively false.

this statement is objectively false.

1 comments

I'm just an AI researcher, what do I know?
> I'm just an AI researcher, what do I know?

me too! what do I know?

(at least now we know where the push for this dreadful policy is coming from)

The whole purpose RLVR alignment is to ensure objectively correct outputs.