Y
Hacker News
new
|
ask
|
show
|
jobs
My $600 Mac Mini Runs a 35B AI Model
(
thoughts.jock.pl
)
4 points
by
danebalia
60 days ago
1 comments
bigyabai
60 days ago
> The 35B Trick (Your SSD Is the New GPU Memory)
Wave "bye bye" to your write cycles.
link
RobMurray
60 days ago
why? it's mostly reads. the weights are static.
link
bigyabai
60 days ago
llama-cpp's process is, but macOS itself will swap hard when 10-14gb of memory is paged for LLM inference. Dense models especially would thrash zram.
link
Wave "bye bye" to your write cycles.