Hacker News new | ask | show | jobs
by dgellow 123 days ago
could you share that study?
1 comments

https://arxiv.org/abs/2512.13914

Among many more of them with similar results. This one gives a 39% drop in performance.

https://arxiv.org/abs/2506.18403

This one gives 60-80% after multiple turns.