Y
Hacker News
new
|
ask
|
show
|
jobs
by
julianlam
11 days ago
Last time I tried Gemma 4 (26B-A4B) its memory usage would balloon and consume all of my swap until my machine died.
Qwen 3.6 on the other hand barely uses any memory at all for its KV cache.
1 comments
verdverm
11 days ago
Turns out when you block people from the best and biggest hardware, they get innovative. It reminds me of the Pentium days when everyone was shipping inefficient programs because the processor would be better next year.
link
iknowstuff
10 days ago
we never stopped doing that!
link