|
|
|
|
|
by xiphias2
171 days ago
|
|
It mainly depends on how much NVIDIA is overselling the improvements. With adding RL functions, separating prefill and decode chips, nvfp4 and lots of other architectural changes efficiency of the most valuable tasks goes up as long as the algorithms don't change significantly. Everything else can just stay on older chips. |
|