Hacker News new | ask | show | jobs
by discordance 900 days ago
Confirmed. Currently running Mixtral 8x7B gguf (Q8_0) on a Macbook Pro M1 Max w 64GB ram, and RAM usage is sitting at 48.8 GB.
1 comments

How many t/s?
Around 15 - 20 t/s
Thank you, got the same build M1 Max during Christmas B&H sale and can confirm it's amazing for running local LLMs.