|
|
|
|
|
by dist-epoch
1155 days ago
|
|
> The input string is tokenized into a sequence of token indices (integers) How is this tokenization done? Sometimes a single word can be two tokens. My understanding is that the token indices are also learned, but by whom? The same transformer? Another neural network? |
|