|
|
|
|
|
by gbalduzzi
17 days ago
|
|
Aren't small local models worse efficiency-wise? It means that every person must have a powerful enough machine to power a small model, and we are very, very far away from that. The best solution, from an efficiency point of view, is to use smaller models on datacenters, requiring much less of them. |
|
MacBooks have a lot of memory and a lot of FLOPs. They mostly sit unused all day. Yes, the excess energy use will be higher than a GPU in a datacenter doing the same work, but you have to generate an absurd amount of tokens before the dollar-efficiency catches up with the MacBook.