Hacker News new | ask | show | jobs
by want2takearide 1521 days ago
That's why you don't use cosine activation and always limit yourself to Lipschitz functions, I guess?