|
|
|
|
|
by milgrum
281 days ago
|
|
How many TPS do you get running GPT OSS 120b on the 395+? Considering a Framework desktop for a similar use case, but I’ve been reading mixed things about performance (specifically with regards to memory bandwidth, but I’m not sure if that’s really the underlying issue) |
|
A 70b dense model is slower
Qwen coder 30b Q4 runs 40+.