|
|
|
|
|
by bitL
1123 days ago
|
|
100x is an assumption. LLMs can hit a ceiling (already prohibitively expensive to train), GPUs can hit a ceiling (we are not far from silicon semiconductor limits) and we might end up with a decade of "almost there, but not yet". |
|
There is still relatively low-hanging fruit because this is a very specific application that suddenly has a huge amount of attention. So there are software improvements, model improvements, and hardware improvements. Probably before we even start a truly new paradigm, Nvidia can get close to 10X by focusing on GPT. Model improvements can likely get another order of magnitude.
There are also radically different compute-in-memory paradigms in the pipeline.