Hacker News new | ask | show | jobs
by sourcecodeplz 14 days ago
$1.5kpm for SOTA. 128gb you run DSV4 Flash.
1 comments

What's the point of running it locally though? Inference for open models is quite cheap already. They could just selfhost, anyway. The experience of running LLMs locally will be excruciatingly bad in comparison at least for the near future.