|
|
|
|
|
by skeledrew
54 days ago
|
|
It'll be a while yet before open models that're good enough will be viable for local use. Heck I've been trying to use the Qwen 3.5 39B A3B on my system, which is modest but no slouch, and have only been able to get ~4.5 tok/s after optimization, and it really runs my system red (fans instantly go crazy). It's just not practical for serious work. |
|
I get about 30 tok/s, which is far from blazing, but given the capability it has it is absolutely viable for accelerating my work.