| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Twirrim 125 days ago
	Just in case anyone hasn't seen this yet: https://github.com/ggml-org/llama.cpp/discussions/15396 a guide for running gpt-oss on llama-server, with settings for various amounts of GPU memory, from 8GB on up