|
|
|
|
|
by refulgentis
1190 days ago
|
|
I was replying to a comment that said it “seems fine.” It does not seem fine. It is incomprehensible and doesn’t match the results I’ve seen from 7B through 65B. It is true that RLHF could improve it, and perhaps then this severe of optimization will seem fine. |
|