Hacker News new | ask | show | jobs
by thaumasiotes 1016 days ago
> Not sure how you can assume there was no underlying improvement, and these are cases of feeding it the answers.

Compare

> And it's only fixed for the stated case, but if you reverse the genders, GPT-4 gets it wrong.