Hacker News new | ask | show | jobs
by anthonix1 623 days ago
... which also has a much lower power cap
1 comments

Not that much lower, 295W vs 355W, and for LLM inference VRAM bandwidth is the main bottleneck. But the price is ridiculous.