|
|
|
|
|
by mistercow
637 days ago
|
|
I think o1 is a pretty big step in this direction, but the really tricky part is going to be to get models to figure out what they’re bad at and what they’re good at. They already know how to break problems into smaller steps, but they need to know what problems need to be broken up, and what kind of steps to break into. One of the things that makes that problem interesting is that during training, “what the model is good at” is a moving target. |
|