Hacker News new | ask | show | jobs
by _false 265 days ago
Completely agree in principle, I'd expect this when minimizing entropy over any text incl. code. However, evals across variety of domains show that LLMs can reach (and even surpass) expert performance[^1].

[1]: https://arxiv.org/abs/2508.17669