Y
Hacker News
new
|
ask
|
show
|
jobs
by
dietr1ch
614 days ago
I guess that if the bulk of the computation goes into the multiplications, you can work in the log-space and simply sum, and when the time comes to actually do a sum on the original space you can go back and sum.
1 comments
a-loup-e
614 days ago
Not sure how well that would work if you're often adding bias after every layer
link