Y
Hacker News
new
|
ask
|
show
|
jobs
by
singularity2001
601 days ago
maybe this signal needs to be learned in the final step of reinforcement learning where people decide whether "I don't know" is the right answer