Hacker News new | ask | show | jobs
by idonotknowwhy 41 days ago
Then why do the original Command-R, Command-R+ and WizardLM2-8x22B (taken down because Microsoft forgot to run safety checks) get it right every time? But the newer models get it wrong?

I’m not saying it’s a “political conspiracy”, it’s the alignment tax.