Y
Hacker News
new
|
ask
|
show
|
jobs
by
leumon
128 days ago
My locally running nemotron-3-nano quantized to Q4_K_M gets this right. (although it used 20k thought tokens before answering the question)