Hacker News new | ask | show | jobs
by austinjp 1478 days ago
> asking [neural networks] to explain themselves using their own capabilities

Exactly. This could be profound. I'm looking forward to further work here. Sure, the examples here are daft, but developing this approach could be like understanding a talking lion [0] only this time it's a lion of our making.

[0] https://tzal.org/understanding-the-lion-the-in-joke-of-psych...

1 comments

I think it’s more likely we can train two neural networks, one to make the decision and one to take the same inputs (or the same inputs plus the output from the first one) and generate plausible language to explain the first. This seems to correspond to what we dimwits consciousness and frankly I would doubt one system can accurately explain its own mechanism. People surely can’t.