Hacker News new | ask | show | jobs
by dauertewigkeit 942 days ago
It's part of the TensorLi definition where all the magic happens.
1 comments

That is true. I went for a simple implementation of the layer norm and included it in the tensorli definition. But it would have been better to define it as a moduli for clarity.