|
|
|
|
|
by julianlam
44 days ago
|
|
Interesting that Gemma 4 didn't crack the top 10. I've been experimenting with the 26B-A4B model with some surprisingly good results (both in inference speed and code quality — 15 tok/s, flying along!), vs my last few experiments with Devstral 24B. Not sure whether I can fit that 35B Qwen model everybody's so keen on, on my 32GB unified RAM. However I think I may be in the minority of HN commenters exploring models for local inference. |
|