Hacker News new | ask | show | jobs
by Gracana 313 days ago
Have you used any thinking models? I remember being surprised by QwQ-32B when I tried it. It would think about what I said and how it should respond, reiterate the behaviors I had assigned to it, and respond accordingly. That constant self-reinforcement in the thinking phase seemed to keep it on track.
1 comments

I might have tried DeepSeek-r1-8B in the past but I'll certainly try again. Good idea.