Hacker News new | ask | show | jobs
by samstave 1062 days ago
This implies that any RLHF is introducing human bias into any "thoughts" the model may have?
1 comments

Yes, but I think your comment has the foundational misconception that it's the first or even main place where bias is put into models.

LLMs are just pattern identifiers and repeaters. They are trained on inherently biased training datasets of inherently biased text written by inherently biased humans. Every single step of training introduces some amount of bias to an LLM.