Hacker News new | ask | show | jobs
by segmondy 135 days ago
1 6000 should be fine, Q6_K_XL gguf will be almost on par with the raw weights and should let you have 128k-256k context.