Hacker News new | ask | show | jobs
by polygamous_bat 760 days ago
This is an interesting question. Is there a “controversy-benchmark” perhaps, to measure this?
1 comments

In that same light, what about over-alignment benchmarks? Things like LLMs refusing to tell you how to destroy all children of a Unity GameObject.