|
|
|
|
|
by pretendscholar
1177 days ago
|
|
As an amatuer I think Markov chains are explicitly a crude frequency association whereas what exactly a neural network is storing to predict the next token involves stored representations in neural weights which can be far more nuanced. |
|