|
|
|
|
|
by wongarsu
362 days ago
|
|
We would presumably stop calling it an LLM somewhere along the way. But I don't see why it couldn't be a transformer architecture at the heart of it, and why that transformer couldn't bee pretrained from Reddit. You would have to track a lot of stuff on to allow an internal stream of consciousness and interaction with the world as well as memory, and do significant reinforcement learning. But we are already doing all of that while still calling the thing an LLM. It's unclear to me where the border lies where it would cease to be an LLM |
|