|
|
|
|
|
by mov_eax_ecx
1014 days ago
|
|
How can i locate this study?. I think you are misrepresenting something. In the gpt4 paper they specifically address this, and find that "Averaged across all exams, the base model achieves a score of 73.7% while the RLHF model achieves a score of 74.0%, suggesting that post-training does not substantially alter base model capability." |
|