| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by aortega 1233 days ago
	Llama.cpp takes advantage that LLaMa 7B is a tiny, very optimized model. It would run in anything, and very fast. I really doubt you can run the 30B or 65B models at acceptable speed on a CPU at least for a couple years. (I'm ready to eat my words in a couple weeks)