For GP's purpose, can face recognition techniques be repurposed for, um, other body parts recognition? Sometimes the actresses are facing away from camera. There are exposed lips, if that helps.
Yes, for actresses _and_ actors I'm sure you'd get the same level of performance as you would for any facial recognition use case. You can't do facial recognition on someone's back, but I'm sure there are other techniques/models that can be applied, many people have unique marks/features etc.
Just because they don't refuse it doesn't mean they are useful.
I found a few pornographic pictures on the web to hand to Abliterated Gemma4 12B(literally just to test this) and it needs pushing just to accept that people can be naked.
It didn't refuse but it also didn't provide useful descriptions such as "this is a pornographic picture of a woman".
> G4: There is a person lying down in a scientific context, if I had to guess they are a biologist in a classroom
> me: Is she wearing any clothes?
> G4: No.
Also, it is obsessed with penises —seeing them in compositions where there is only a female. I suppose it's been trained to ban dick pics or something.
Prompting may help some but 12B seems to be a bit worse than E4B with the vision/audio model at voice and text reading so maybe that one would do better.
Last time I tried whisper, it hallucinated an elaborate conversation from sounds of slapping and moaning and it took minutes to spit every single line of it.
If I remember correctly, the whisper documentation actually recommends to trim non-speech portions as the models halucinate heavily during those portions.
just because it is local does not mean it wouldn't reject explicit content. you can definitely try and find abilated models and can attempt to use unsloth or something similar to tune it properly.
Is abliteration even necessary. While “playing around” I have noticed that most models are very strict only in the first prompt. The moment you get past that with a good turn, the next turn on you can get them to do _anything_.
You might want to add something like yolo finetune to detect scenes + face recognition too.