Hacker News new | ask | show | jobs
by Mars008 320 days ago
> 4 bit quantized 120B model on a 96GB workstation card, the Blackwell Pro workstation

Would be interesting to know how it performs in terms of quality and token/sec.