| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by glintik 813 days ago
	How much RAM browser wants to run LLM local processing?

3 comments

Usually around 5 GB for a 7B 4-bit quantized model.

probably less than it needs for a few dozen open tabs based on my past profiling experiences...

How long are you willing to wait?