Hacker News new | ask | show | jobs
by Beijinger 21 hours ago
Does it work for porn collections too?
5 comments

You'll need a lora for this, porn content rejection is heavy. Or you'll need a abliterated model, not sure if vision also works.

You might want to add something like yolo finetune to detect scenes + face recognition too.

For GP's purpose, can face recognition techniques be repurposed for, um, other body parts recognition? Sometimes the actresses are facing away from camera. There are exposed lips, if that helps.
Yes, for actresses _and_ actors I'm sure you'd get the same level of performance as you would for any facial recognition use case. You can't do facial recognition on someone's back, but I'm sure there are other techniques/models that can be applied, many people have unique marks/features etc.
Vision still works perfectly fine in abliterated models.
Just because they don't refuse it doesn't mean they are useful.

I found a few pornographic pictures on the web to hand to Abliterated Gemma4 12B(literally just to test this) and it needs pushing just to accept that people can be naked.

It didn't refuse but it also didn't provide useful descriptions such as "this is a pornographic picture of a woman".

> G4: There is a person lying down in a scientific context, if I had to guess they are a biologist in a classroom

> me: Is she wearing any clothes?

> G4: No.

Also, it is obsessed with penises —seeing them in compositions where there is only a female. I suppose it's been trained to ban dick pics or something.

Prompting may help some but 12B seems to be a bit worse than E4B with the vision/audio model at voice and text reading so maybe that one would do better.

Never tried any of this for porn, just speaking out how I would go about it tbh!
Asking the important questions
I was meandering through the comments about to leave the topic when my interest suddenly piqued upon reading the word porn.
Why it’s always the same question? Hahah. I posted my project over Reddit and I got the same one hahah
Ha ha ha, it's because most humans overlap on a few things - like eating, shitting, sleeping and fucking, ha ha ha.
Last time I tried whisper, it hallucinated an elaborate conversation from sounds of slapping and moaning and it took minutes to spit every single line of it.
Parakeet has been trained to detect non-voice sounds and exclude that from identification, so you might have better luck with that family.
If I remember correctly, the whisper documentation actually recommends to trim non-speech portions as the models halucinate heavily during those portions.
Not sure if you’re being sarcastic but I think this is an interesting question. Would deep seek be useful here since it is local?
just because it is local does not mean it wouldn't reject explicit content. you can definitely try and find abilated models and can attempt to use unsloth or something similar to tune it properly.
Is abliteration even necessary. While “playing around” I have noticed that most models are very strict only in the first prompt. The moment you get past that with a good turn, the next turn on you can get them to do _anything_.
Depends how deep you wanna go.