|
|
|
|
|
by throwaway4aday
896 days ago
|
|
> Interestingly, Eliza still outperforms ChatGPT in certain Turing test variations. I see we have a new entry for the 2024 Lies of Omission award. The article linked to plainly shows that Eliza only beats ChatpGPT-3.5 and is in the bottom half when ranked against a variety of different system prompts. An excellent ass covering strategy that relies on the reader not checking sources. An honest author would have actually quoted the article saying: > GPT-4 achieved a success rate of 41 percent, second only to actual humans. instead of constructing a deliberately misleading paraphrase. |
|
The blog appears to have been updated to specify GPT3.5, but the original version was accurate.
The paper itself is interesting as it covers the limitations (it has big methodological issues), how the GPT prompts attempted to overridei default chatGPT tone and reasons why ELIZA performed surprisingly well (some thought it was so uncooperative, it must be human!) https://arxiv.org/pdf/2310.20216.pdf