Hacker News new | ask | show | jobs
by bwv848 56 days ago
I've been trying the Q4_K_M version, and sometimes it gets stuck in a loop. Gemma 4 doesn’t have this issue.
3 comments

This has happened before with quantizations and other backends (ones not used by the research lab). Give it a week, download latest versions of everything, and try again.
I'm having the same issues, the more I use it. The repetition penalty doesn't seem to help.

I get some really amusing 'reflective' responses, but I think it needs a bit more cooking. Maybe I'll try another variant.

perhaps increasing repitition_penalty might be helpful