Hacker News new | ask | show | jobs
by nrp 645 days ago
How are you finding 2b/3b quantized llama 405B? Is it behaving better than 8b or 16b llama 70B?