|
|
|
|
|
by HarHarVeryFunny
644 days ago
|
|
Base64 encoding is very simple - it's just taking each 6-bits of the input and encoding (replacing) it as one of the 64 (2^6) characters A-Za-z0-9+/. If the input is 8-bit ASCII text, then each 3 input characters will be encoded as 4 Base64 characters (3 * 8 = 24 bits = 4 * 6-bit Base64 chunks). So, this is very similar to an LLM having to deal with tokenized input, but instead of sequences of tokens representing words you've got sequences of Base64 characters representing words. |
|
Maybe "no idea" was a bit much for this example but any idea certainly didn't come from seeing the matrices themselves fly.