Hacker News new | ask | show | jobs
by hunter2_ 815 days ago
It's not comparing your response to some hard truth, it's comparing your response to a typical response. Sort of like how LLMs dish stuff out based on what's probable, not based on hard truth.

So when you fail, it's not really saying you're wrong, it's saying you're not like most.

1 comments

I'm not helping. I always try to get a few wrong just to screw with their training.