Y
Hacker News
new
|
ask
|
show
|
jobs
by
NicoJuicy
264 days ago
If you have a 24 gb 3090. Try out qwen:30b-a3b-instruct-2507-q4_K_M ( ollama )
It's pretty good.
2 comments
naabb
264 days ago
tbf I also run that on a 16GB 5070TI at 25T/S, it's amazing how fast it runs on consumer grade hardware. I think you could push up to a bigger model but I don't know enough about local llama.
link
jszymborski
264 days ago
Don't need a 3090, it runs really fast on an RTX 2080 too.
link