Hacker News new | ask | show | jobs
by rubiquity 52 days ago
At 8-bit quantization (q8_0) I get 20 tokens per second on a Radeon R9700.