| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by blibble 263 days ago
	> The statement that correctness plays no role in the training process is objectively false. this statement is objectively false.

1 comments

I'm just an AI researcher, what do I know?

> I'm just an AI researcher, what do I know?

me too! what do I know?

(at least now we know where the push for this dreadful policy is coming from)

The whole purpose RLVR alignment is to ensure objectively correct outputs.