Hacker News new | ask | show | jobs
by unshavedyak 1036 days ago
So is there some system that implements text/image classification for automatic law enforcement submission that you're referring to? Also jurisdiction, etcetc.

It all seems incredibly complex. Not a reason to "not try", but i suspect we'll struggle to implement even the most basic thing. And even then, take that basic thing and apply it to every software where users can input data.

Plus we'd have to convince everyone to do this. Automatic scanning and submission of data is not a well liked topic. Remember how Apple doing basic CSAM scanning was full of panic?

Even if a government _forced_ us to do this, jurisdiction alone would be a big question. Some serious questions that need some serious thought, imo. Is being hand-wavy even worth the time?

1 comments

you just need you search text, the prompt, for things normal people wouldn’t search for, not at all complex. nobody normal asks AI to create child pornography
You picked the easiest (though still not easy imo) thing, and ignored the rest of the complexity haha.

If it's so easy i'd love to see your implementation that works multi-language, across all media types, for all jurisdictions and hell handles burden all the massive number of edge cases.

Or frankly, any implementation. Whatever you think is easy and everyone should be doing - please point to an E2E implementation of it. Maybe i misunderstand your scope. Something where if a user submits CSAM, or does something to some country authority..?

Prompts use text, period. We already have the technology to search text for child pornography, both google and duck duck go use this.

there’s also negative prompting, which tells the model don’t do these things.

hallucinations can still happen but it’s much easier than you’re making it out to be.

Yea, identifying isn't the hard part though - that's not what i'm concerned about. Automatic submission to parties, APIs for governments, jurisdiction, etc. Wonder what Google does if you type in CSAM triggers.

I suspect we could use some sort of central management service for "Internet Reports". Ie to deal with jurisdictions, reporting something to the right people, etc, as well as the complexities involved with identifying people.

Either way i think you're underselling the complexity. Or maybe you think it's so easy but no one cares, /shrug. Seems a long list of questions i'd have before i could even begin to implement it.