Hacker News new | ask | show | jobs
by Patrick_Devine 76 days ago
Try it with mxfp8 or bf16. It's a decent model for doing tool calling, but I wouldn't recommend using it with 4 bit quantization.