|
|
|
|
|
by torginus
106 days ago
|
|
I really don't want to overrule your expertise in this regard, but an 5x efficiency gain in a single generation feels like its too much, especially considering how newer process nodes have been yielding less and less improvements. Just to compare and contrast: https://www.videocardbenchmark.net/power_performance.html Here's a synthethic benchmark page listing every GPU in recent memory. True, its not AI, but if we look at the 1080 Ti, a 9 year old card at this point, and compare it with the 5090 we see the gains were 190/74=2.56x in that timespan that involved multiple die shrinks and uArch changes. I think these numbers might not hold up on IRL workloads, and afaict older datacenter cards still hold up well and are being used in production. |
|
E.g. the next gen might have hardware inference for lower bits, more memory bandwidth, etc.