|
|
|
|
|
by weatherlite
169 days ago
|
|
> None of the stated problems are actually issues with LLMs after on policy training is performed But still , isnt it a major weakness they have to RL on everything that has not much data? That really weakens the attempt to make it true AGI. |
|
AGI would be a universal learner, not a magic genie. It still needs to do learning (RL or otherwise) in order to do new tasks.