Hacker News new | ask | show | jobs
by lostmsu 613 days ago
No GPU inference support?
1 comments

> that support fast and lossless inference of 1.58-bit models on CPU (with NPU and GPU support coming next).