|
|
|
|
|
by Jensson
832 days ago
|
|
I think he disagrees with 4: 4. Language prediction training will not get stuck in a local optimum. Most previous things we train on could have been better served if the model developed AGI, but they didn't. There is no reason to expect LLMs to not get stuck in a local optimum as well, and I have seen no good argument as to why they wouldn't get stuck like everything else we tried. |
|