|
|
|
|
|
by ta8645
472 days ago
|
|
Yes, but that just effectively recreates the pretraining. You're going to have to explain everything down to what an atom is, and essentially all human knowledge if you want to have any ability to consider abstract solutions that call on lessons from foreign domains. There's a reason people with comparable intelligence operate at varying degrees of effectiveness, and it has to do with how knowledgeable they are. |
|
This paper claimed transformers learn a gradient-descent mesa-optimizer as part of in-context learning, while being guided by the pretraining objective, and as the parent mentioned, any general reasoner can bootstrap a world model from first principles.
[0] https://arxiv.org/pdf/2212.07677