Hacker News new | ask | show | jobs
by guilhas 1383 days ago
That is in line with what I said

Most publicly available sources of sexual content are adult, legal, don't have children

So if you feed Pornhub to your model, which most porn is actually well tagged, as is many ML data, you really have to go out of your way to get any children

Not saying is not possible, I am saying is easy to avoid. Not as an impossibility to separate both as you suggested above

1 comments

Nope. You missed the point. I said the opposite.

If your training data have any absolutely innocent children pictures at all and also NSFW content with adults then model will be able to generate NSFW content with children. It's that simple.

Also even if you remove any children completely from training model will still be able to generate CSAM-like content because a lot of legit porn include jailbait models.