| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by baq 876 days ago
	IME ollama ran mixtral on a 1070 fast enough.

1 comments

dimask 875 days ago

Though it most probably does not run in on the 1070 but rather on the cpu. It cannot fit on a 1070, it is not about speed, a 1070 cannot run it period.

link

Dkuku 875 days ago

In llama.cpp You can offload some of the layers to gpu with -ngl X. Where x is the number of layers

link