Hacker News new | ask | show | jobs
by pennomi 759 days ago
In that same light, what about over-alignment benchmarks? Things like LLMs refusing to tell you how to destroy all children of a Unity GameObject.