Y
Hacker News
new
|
ask
|
show
|
jobs
by
polygamous_bat
760 days ago
This is an interesting question. Is there a “controversy-benchmark” perhaps, to measure this?
1 comments
pennomi
759 days ago
In that same light, what about over-alignment benchmarks? Things like LLMs refusing to tell you how to destroy all children of a Unity GameObject.
link