|
|
|
|
|
by faramarz
798 days ago
|
|
it's not about a single point encapsulating a novel, but how sequences of such embeddings can represent complex ideas when processed by the model's layers. each prediction is based on a weighted context of all previous tokens, not just the immediately preceding one. |
|
I suppose that when you each element in the vector weighs 16 bits then the space is immense and capable to have a novel in a point.