Hacker News new | ask | show | jobs
by brookman64k 509 days ago
Thank you. Which is currently the most capable version running reasonably fast on a 3090 (24GB of VRAM)?
1 comments

The Llama distilled version Q4_K_M should be reasonably fast and good!!