| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Gracana 313 days ago
	Have you used any thinking models? I remember being surprised by QwQ-32B when I tried it. It would think about what I said and how it should respond, reiterate the behaviors I had assigned to it, and respond accordingly. That constant self-reinforcement in the thinking phase seemed to keep it on track.

1 comments

I might have tried DeepSeek-r1-8B in the past but I'll certainly try again. Good idea.