|
|
|
|
|
by js8
985 days ago
|
|
> We can't say NN aren't doing something similar. I agree, but my point was different. We can't say what it does theoretically, therefore we don't know how reliable it is (we don't understand the tradeoffs and failure modes). At least most humans have a way to assess their own reliability, and can see where their reasoning (or of their fellow humans) is inconsistent. |
|
It comes down to the physical world.
Humans can " assess their own reliability" when they can all point at something in the real world, and come to some agreement on what they are all seeing, what to call it, etc..
When humans get off base, if it is tangible, like an apple, they can all point at the apple, and bring themselves back into alignment, that is an apple.
But, for abstract concepts in philosophy, or morals, etc.. Something that is not tangible. Humans can 'drift' just as much as AI.
Humans can get into echo chambers -> and 'go nutz', absorbing others misinformation.
LLMs Learning from other LLSm' -> the 'models drift' over time.
https://news.ycombinator.com/item?id=37811610