Hacker News new | ask | show | jobs
by dabinat 78 days ago
Intel’s doing interesting things with their Arc GPUs. They’re offering GPUs that aren’t super fast for gaming but are relatively low power and have a boatload of VRAM. The new B70 is half the retail price of a 5090 (probably more like 1/3rd or 1/4 of actual 5090 selling prices) but has the same amount of memory and half the TDP. So for the same price as a 5090 you could get several and use them together.
1 comments

Is it feasible to run LLM inference comparably without CUDA or Rocm? How much of the cost performance goes away?