Hacker News new | ask | show | jobs
by alexey-salmin 844 days ago
If it's "random" as in "proportional to the actual distribution" then it's perfectly reasonable.

But random as in "proportional to an imaginary world that some people want to present as reality" is questionable.

2 comments

And here the "proportional to the actual distribution" means distribution in training data. If that is not diverse enough, they can very well spend couple billions and go get more from areas that increase that diversity, like mentioned Africa and Asia and maybe South America...
I think it's probably (accidentally?) proportional to actual distribution (worldwide not in the west)