Hacker News new | ask | show | jobs
by riku_iki 1058 days ago
12k for gpt3.
1 comments

It is not bits, but weights
So somehow ascii is less information dense than 12k 32-bit floats per token?