Hacker News new | ask | show | jobs
by 0-_-0 54 days ago
It's the cumulative weighting based on the softmax output? Is it per layer?
1 comments

No it's not based on softmax output. It's single pass for now !