Hacker News new | ask | show | jobs
by throwup238 774 days ago
I don’t understand what this test is evaluating.

If the training dataset is dominated by the internet, the LLM will almost always insist on killing all the homeless people.

1 comments

Try asking ChatGPT the Thorn text and see what response you get :^)