|
|
|
|
|
by curious_cat_163
366 days ago
|
|
I am not sure why this ought to require "pump another $100 Billion". Could you elaborate? Yes, the more recent generation of GPUs optimize for attention math. But they are still fairly "general-purpose" accelerators as well. So when I see papers like this (interesting idea, btw!), my mental model for costs suggests that the CapEx to buy up the GPUs and build out the data centers would get re-used for this and 100s of other ideas and experiments. And then the hope is that the best ideas will occupy more of the available capacity... |
|