Y
Hacker News
new
|
ask
|
show
|
jobs
by
segmondy
135 days ago
1 6000 should be fine, Q6_K_XL gguf will be almost on par with the raw weights and should let you have 128k-256k context.