| > it may be possible to achieve a 100× energy-efficiency advantage Running the math on a machine with 8x A100 (enough to run today's LLMs), that would be 300w * 8gpus / 100 = 24w. This is within striking distance of IOT and personal devices. I'm trying to imagine what a world would look like where generative text models are commodetised to the point where you can either generate text locally on your phone, or generate GBs of text in the cloud. I have to admit it's very hard to make any sort of accurate prediction. |