Hacker News new | ask | show | jobs
by Jordan-117 630 days ago
It's the RLHF training to make them squeaky clean and preternaturally helpful. Pretty sure without those filters and with the right fine-tuning you could have it reliably clone any writing style.
2 comments

One only need to go to the dirtier corners of the llm forums to find some _very_ interesting voices there.

To quote someone from a tor bb board: my chat history is illegal in 142 countries and carries the death penalty in 9.

But without the RLHF aren’t they less useful “products”?