Hacker News new | ask | show | jobs
by dhruvdh 796 days ago
I would imagine the importance of weights depends on the prompt. How do you decide which weights are important?
1 comments

Yeah, that is the point more or less - it dynamically chise the weights layer per layer depending on the internal state.

A bit technical explaination here. https://kolinko.github.io/effort/equations.html