Y
Hacker News
new
|
ask
|
show
|
jobs
by
anon373839
453 days ago
That may well be true. I know that earlier models like Llama 1 65B could tolerate more aggressive quantization, which supports that idea.