Hacker News new | ask | show | jobs
by quotemstr 811 days ago
I wonder whether we're missing out on techniques that work well on large models but that don't show promise on small ones
1 comments

More like we're missing out on techniques full stop. Proving things at scale is GPU expensive and gatekeeps publication and therefore accessibility.