| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ShamelessC 1235 days ago
	Reality has an inherent liberal bias. In all honesty though, the dataset it was trained on may have a liberal bias. This is _precisely_ the sort of bias you should expect from a large language model .............................

2 comments

CSMastermind 1235 days ago

Weren't Reddit posts part of the core data set used to train the model?

That alone probably explains the bias.

link

zapdrive 1234 days ago

Yes. And it probably wouldn't have a bias if reddit wasn't heavily censored, with anyone right leaning being banned. It's practically a left wing propoganda website now.

link

kfrzcode 1235 days ago

What do you mean about liberal bias.... Reality is by it's very nature unbiased. It just...is

link

ShamelessC 1235 days ago

It was a joke. I mean, it's a joke I personally happen to believe is true, but not something I will state as factual.

Somewhere on the political spectrum lies objective facts, truth, and logic. My priors tell me this side tends to be left-of-center. My priors also tell me that the majority of people's political beliefs are decided for them by their parents and their upbringing. So I'm happy to admit that plenty of liberals are in it for the wrong reasons. That doesn't detract from it being the side on the correct side of history.

But again, it was a joke.

link

zapdrive 1234 days ago

I also used to believe that facts and truth were left of center. But after the whole "get vaccinated or you will be killing someone's grandparents" propoganda came out to be false, I have a hard time believing the left.

link

ShamelessC 1234 days ago

Okay.

link

jug 1235 days ago

A large data set will be biased if the sum of data is leaning towards some direction.

I'm not sure you can produce a truly unbiased model without actively interfering with it.

Just consider the fact that you'll find less republicans among scientists. (source: https://www.pewresearch.org/politics/2009/07/09/section-4-sc...)

Now the research-based data on ChatGPT will be biased. It takes no active "inserting" by OpenAI. It may manage creating the bias all by itself.

link