Hacker News new | ask | show | jobs
by leumon 128 days ago
My locally running nemotron-3-nano quantized to Q4_K_M gets this right. (although it used 20k thought tokens before answering the question)