Hacker News new | ask | show | jobs
by ShamelessC 1235 days ago
Reality has an inherent liberal bias.

In all honesty though, the dataset it was trained on may have a liberal bias. This is _precisely_ the sort of bias you should expect from a large language model .............................

2 comments

Weren't Reddit posts part of the core data set used to train the model?

That alone probably explains the bias.

Yes. And it probably wouldn't have a bias if reddit wasn't heavily censored, with anyone right leaning being banned. It's practically a left wing propoganda website now.
What do you mean about liberal bias.... Reality is by it's very nature unbiased. It just...is
It was a joke. I mean, it's a joke I personally happen to believe is true, but not something I will state as factual.

Somewhere on the political spectrum lies objective facts, truth, and logic. My priors tell me this side tends to be left-of-center. My priors also tell me that the majority of people's political beliefs are decided for them by their parents and their upbringing. So I'm happy to admit that plenty of liberals are in it for the wrong reasons. That doesn't detract from it being the side on the correct side of history.

But again, it was a joke.

I also used to believe that facts and truth were left of center. But after the whole "get vaccinated or you will be killing someone's grandparents" propoganda came out to be false, I have a hard time believing the left.
Okay.
A large data set will be biased if the sum of data is leaning towards some direction.

I'm not sure you can produce a truly unbiased model without actively interfering with it.

Just consider the fact that you'll find less republicans among scientists. (source: https://www.pewresearch.org/politics/2009/07/09/section-4-sc...)

Now the research-based data on ChatGPT will be biased. It takes no active "inserting" by OpenAI. It may manage creating the bias all by itself.