| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by minihat 1022 days ago

My mental model of gpt-4 is apparently well calibrated for whether the model will give me a useful output that is close to what I asked for.

However, I'm not great at predicting whether the model will output a 100% correct response with no flaws whatsoever.

Unfortunately, this website mostly tests for the latter.

2 comments

whimsicalism 1022 days ago

To me it is fascinating how when people are not super good at something, they often invent some secondary “true”/“better” task that they were actually good at

link

FabHK 1021 days ago

The website specifies its criteria for accepting an answer. Just use that threshold instead of whatever you in your mind deem “useful”.

link