Hacker News new | ask | show | jobs
by circuit10 1166 days ago
Probably because the parameter count is way lower so it's less able to memorize things