| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by jackblemming 1158 days ago

I work in vision. Go look up the imagenet leaderboard. Look at the results of Alexnet vs the top result today. The trend is a log line. The top contending architectures still include CNNs trained on backprop, they’ve just had a decade of tricks applied to eek out some improvements. The transformer based vision models aren’t much better.

Talk to any machine learning expert and they’ll tell you the math and fundamentals haven’t really changed since the 90s, we’ve just gotten better at scaling. Transformers came onto the scene half a decade ago and we could scale them much better than CNNs, but like CNNs of today, we’ve hit the diminishing returns limit.

Maybe look at actual data instead of being dismissive to different opinions.

1 comments

olddustytrail 1158 days ago

Ok, I checked. Alexnet 63%. Top rated 91%. That's a big difference.

And what did you expect other than a log curve. The maximum is obviously 100%.

link

jackblemming 1158 days ago

So interestingly, you can actually have linear or exponential curves on your way from 0 to 100. And you completely ignored how the basic building blocks and algorithms are more or less the same. I think I'm done discussing with non-experts.

link