Hacker News new | ask | show | jobs
by trott 488 days ago
Another way to look at this is that there are 12,290 bits of information in choosing 817 samples from 10,000,000.
1 comments

And much more information when selecting just as many examples from quadrillions of randomly generated examples.

The information from the selection criteria isn't available to the model, just the chosen samples.