Hacker News new | ask | show | jobs
by xiphias2 171 days ago
It mainly depends on how much NVIDIA is overselling the improvements.

With adding RL functions, separating prefill and decode chips, nvfp4 and lots of other architectural changes efficiency of the most valuable tasks goes up as long as the algorithms don't change significantly.

Everything else can just stay on older chips.