| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by rando_dfad 991 days ago

Seeing some comments which align with but misunderstand what I suspect the point is.

Generative AI outputs the median of the probability distribution.

Yes, that's an issue with the training data (as people comment), but it is more than that. The training data will have some observations away from the median, but it will still only have one median.

To take my observation to the extreme (where it will fail), if the data is 49.999% X and 50.001% Y, then Y is the most likely output and you'll only every see Y from generative AI.

[yes, I already said that extreme was wrong because the model samples around the median not exactly the median]

1 comments

haykmartiros 991 days ago

This is flat out not true. Generative AI is meant to approximate the distribution. It’s biased from the data, yes, but if it was just the median it would only spit out one face, one gender, etc.

link