Hacker News new | ask | show | jobs
by jefft255 919 days ago
A couple millions IIRC. Nothing "large" compared to modern transformer models.
1 comments

Thanks for getting back to me. That's what I thought. The magic seems to start happening in the low billions of parameters -- and I say "seems" b/c there's no consensus as to whether it's really truly magic! In any case, it's a shame that most of the human brainpower capable of improving SotA AI doesn't have access to large-scale resources.