Hacker News new | ask | show | jobs
by tanananinena 618 days ago
This is probably the most interesting (and insightful) paper on grokking I’ve read recently: https://arxiv.org/abs/2402.15555