Hacker News new | ask | show | jobs
by viccis 148 days ago
By what measure? What's "safe"?
1 comments

https://crfm.stanford.edu/helm/air-bench/latest/#/leaderboar...

This isn’t the gotcha question you think it is. AI safety is being defined and measured.

Cool, another metric to game like they do the other ones.