Y
Hacker News
new
|
ask
|
show
|
jobs
by
simlevesque
216 days ago
gpt-oss:120b
https://til.simonwillison.net/llms/codex-spark-gpt-oss
1 comments
hamdingers
216 days ago
Am I missing it or is there no information about performance? Looking for a tokens/sec
link
aseipp
216 days ago
Right now I get 59 tok/sec on GPT-OSS 120B using Unsloth's dynamic 4-bit quants, via llama.cpp
https://news.ycombinator.com/item?id=45881049
link
simlevesque
216 days ago
He didn't give that info but the transcript linked at the end shows how much time was spent for each query.
link