Hacker News new | ask | show | jobs
by doku 309 days ago
Did the original paper show that the toy model was fully grokked?