|
|
|
|
|
by jsemrau
769 days ago
|
|
>The size of the cached internal state of the network processing the book is much larger than the size of the book It's funny that sometimes people consider LLMs as compression engines. While a lot of information gets lost in each direction (through the neural net) |
|