Hacker News new | ask | show | jobs
by lelandbatey 15 days ago
Something is definitely going wrong with your Qwen setup, in the link you posted it starts and ends with a compaction step due to a 4k token context limit. Qwen 35b supports I think up to 200k+ context limit (though I run only with 128k), that seems to be a major source of the problem.
1 comments

Good call, I need to check if LM Studio is misconfigured.