|
|
|
|
|
by wizzwizz4
184 days ago
|
|
A GPT model would be modelled as an n-gram Markov model where n is the size of the context window. This is slightly useful for getting some crude bounds on the behaviour of GPT models in general, but is not a very efficient way to store a GPT model. |
|