Hacker News new | ask | show | jobs
by singularity2001 601 days ago
maybe this signal needs to be learned in the final step of reinforcement learning where people decide whether "I don't know" is the right answer