| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by ytpete 92 days ago

- When GPT 4 was asked to evaluate resume executive summaries, it preferred ones written by GPT over human-written ones > 93% of the time.

- Similar "bias" was exhibited by other models including LLaMA 3.3 and Deepseek v3.

- Even when human annotators judged the human-written summary to be higher quality, leading LLMs still preferred their own writing 67-82% of the time.

- Preference was stronger in larger models.

- In several cases, LLMs also prefer their own writing over that of other LLMs.

There's a pretty decent longer summary in this thread where I first heard about the article: https://x.com/heynavtoor/status/2048088874686300431