| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by NicoJuicy 311 days ago
	If you have a 24 gb 3090. Try out qwen:30b-a3b-instruct-2507-q4_K_M ( ollama ) It's pretty good.

2 comments

naabb 311 days ago

tbf I also run that on a 16GB 5070TI at 25T/S, it's amazing how fast it runs on consumer grade hardware. I think you could push up to a bigger model but I don't know enough about local llama.

link

jszymborski 311 days ago

Don't need a 3090, it runs really fast on an RTX 2080 too.

link