| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mov_eax_ecx 1014 days ago
	How can i locate this study?. I think you are misrepresenting something. In the gpt4 paper they specifically address this, and find that "Averaged across all exams, the base model achieves a score of 73.7% while the RLHF model achieves a score of 74.0%, suggesting that post-training does not substantially alter base model capability."

3 comments

The problem with these studies is that we really still don’t know. Nobody can replicate the papers of OpenAI.

Found it, it is a pretty recent paper.

Given the homogeneity of responses on taboo subjects, there's probably something exogenous to the model at work.