Y
Hacker News
new
|
ask
|
show
|
jobs
by
riku_iki
1061 days ago
absolutely not. Transformer layers already communicate using embeddings, and ASCII would be absolutely less efficient there.
1 comments
Rhapso
1060 days ago
And how many bits are in an embedded vector?
link
riku_iki
1060 days ago
12k for gpt3.
link
riku_iki
1060 days ago
It is not bits, but weights
link
Rhapso
1058 days ago
So somehow ascii is less information dense than 12k 32-bit floats per token?
link