Hacker News new | ask | show | jobs
by jychang 140 days ago
No.

128GB vram gets you enough space for 256B sized models. But 400B is too big for the DGX Spark, unless you connect 2 of them together and use tensor parallel.