Hacker News new | ask | show | jobs
by benbojangles 20 days ago
I run gemma-4-26b-bf16 in mtp mode and it runs very smooth, spitting out answers in seconds and outputting text 30x faster than i can read.