|
|
|
|
|
by brookst
4 days ago
|
|
IP-protected models manifested directly in silicon. Everything we’re using now is the equivalent of building a GPU on an FPGA: the hardware is general purpose at one abstraction level, and that comes with inefficiency at the next layer up. Collapse the levels, gain efficiency at the cost of generality. |
|
To answer my own question, I bet they could figure out a way to still bill you per-token, if they wanted to.