Hacker News new | ask | show | jobs
by minihat 1022 days ago
My mental model of gpt-4 is apparently well calibrated for whether the model will give me a useful output that is close to what I asked for.

However, I'm not great at predicting whether the model will output a 100% correct response with no flaws whatsoever.

Unfortunately, this website mostly tests for the latter.

2 comments

To me it is fascinating how when people are not super good at something, they often invent some secondary “true”/“better” task that they were actually good at
The website specifies its criteria for accepting an answer. Just use that threshold instead of whatever you in your mind deem “useful”.