|
|
|
|
|
by seanhunter
1199 days ago
|
|
The part which isn't falsifiable (as far as I can see) is whether the model actually has opinions at all as opposed to producing outputs that match or simulate an opinion. That's partly the point of the Chinese room idea - that you can't prove just by looking at the output one way or the other. The things they did to restrict the model don't demonstrate that it would otherwise actually have an opinion though. They just mean that it's being (arguably artificially) prevented from generating certain texts that appear to endorse a certain point of view. A similar example which demonstrates my point while perhaps being a little more clear cut is the Amazon recruiting AI that got shut down because it was unintentionally amplifying bias present in its training set.[1] I don't think we can assume from that that the model actually had opinions which were misogynistic even though it was producing results which were. [1] https://www.reuters.com/article/us-amazon-com-jobs-automatio... |
|
I think this distinction is neither useful nor interesting.