|
|
|
|
|
by NathanWilliams
1214 days ago
|
|
And how will they know when the "useful info" is simply false? Ignore the depressed, aggressive (sorry, "assertive") antics, the fact it can confidently assert false information is the true danger here.
People don't read beyond the headline as it is, they aren't going to check the references (that themselves are sometimes non-existent!) |
|
Having a 'truth' benchmark seems an almost impossible task given the size of the problem space, but it is quite troubling to have statements like "most is useful info", "some info is purely hallucinated", etc, without having any ideas about the numbers, not any confidence indicator (well, 'trust me bro' seems to have been a huge part of the training data). Does anyone have any idea of how true the results might be given certain types of queries?
In my own experience with ChatGPT, I don't think I'm at even 50% of decent answers for my queries. And worse, it's absolutely inconsistent, you might get totally opposite answer one time to the next.