Hacker News new | ask | show | jobs
by llamaInSouth 903 days ago
Fast is great but uncensored is better
3 comments

They had a whole section on guided decoding which can be used to among other thing break censorship/alignment efforts.
Just run/train it yourself.
For anyone else who wishes to do so, Fireship has a low-barrier video[1] on training the uncensored model in the sibling comment (dolphin mistral) for local use.

[1] https://www.youtube.com/watch?v=GyllRd2E6fg

What do you want to use it for? Out of all the things I use LLM's for, there isn't one situation where I care about political correctness or whatever.
Copilot tells me all the time that it doesn't want to answer my questions on multiple topics, never political and not usually talking about illegal things (talking about something illegal in itself isn't illegal anyways). So if you don't care about political correctness, you should care about censorship.

And 2 days ago, I was able to use MS' copilot (not Github's) to generate Python scripts, and now it tells me that it is incapable. Just tried it right now and the functionality came back though..

They just keep removing functionalities. To me that is pretty much censoring or whatever you want to call it. The quality degrades even faster then Google search quality.

here is a very simple example: https://i.imgur.com/vydRUwn.png (I would never ask that question usually, but I can't remember which questions I previously asked because they deleted my history)

I told Bing to stop hallucinating. It got upset, told me to come back when I am ready to talk, and hung up. True story!
Court documents. One mention of rape or murder and there's a 50:50 chance GPT/Claude/Bard will just shut down, when all I want is to extract the entities mentioned in transcripts or opinions.
You probably want to talk to Lexis/Nexis instead if you’re dealing with legal materials. I believe they are working with some RELX affiliates in this space.