Hacker News new | ask | show | jobs
by rahimnathwani 458 days ago

  The LLM isn't performing the desired task.
It's 'not performing the task', in the same way that the humans ranking voice attractiveness are 'not performing the task'.

I wouldn't treat the output as complete garbage, just because it's somewhat biased by an irrelevant signal.