Hacker News new | ask | show | jobs
by ninjagoo 87 days ago
> I’m sorry but you’re demonstrably incorrect.

Please so demonstrate?

2 comments

The onus isn’t on me. It’s on anyone contradicting findings by most benchmarks, because most of them show a clear advantage for Opus and GPT over OSS models.
So Big Claim No Demonstration? :-)
I mean just use them and compare, the gap is obvious.
I did, and I fixed Qwen's issues with trivial sampling and loop detection hacks.

If I can do this, then a company that wants to sell local models seriously could do it too.

> I did, and I fixed Qwen's issues with trivial sampling and loop detection hacks.

Wow, that's amazing! Care to share the changes? Would love to try them out.

It's not amazing at all.

What's amazing is that LLM technologies are so immature that even basic engineering diligence isn't being done. (Like detecting token loops, for example.)