Hacker News new | ask | show | jobs
by macwhisperer 7 days ago
check out a custom 4-bit quant I made today

https://huggingface.co/macwhisperer/Gemma4-12B-SuperDense

should run perfect for 12-16gb with maybe 10-20k context

seems intelligent enough that I would recommend this as a daily driver for friends who just want a local ai that can do most things relatively quickly (getting 10 tps on my m2 air)