|
|
|
|
|
by d--b
943 days ago
|
|
You make the asumption that Q* is a LLM, but I think OpenAI guys know very well that the current LLM architecture cannot achieve AGI. As the name suggests, this things is likely using some form of Q learning algorithm, which makes it closer to the DeepMind models than a transformer. My guess is that they pipe their LLM into some Q learnt net. The LLM may transform a natural language task into some internal representation that can then be handled by the Q-learnt model, which spits out something that can be transformed back again into natural language. |
|
https://arxiv.org/abs/2102.04518