|
|
|
|
|
by Ukv
763 days ago
|
|
> in your argument here you're not talking about a "more compelling evidence of personhood", but rather discriminating towards some specific identity. A preteen non-native speaker is just as much a person, with just as much personhood It's not about how much of a person they are, but how much evidence of personhood responding in a particular way gives. If you were the interrogator and gave some challenge you think only humans can solve, and got back one "asdfghjkl" (no real evidence either way) and one correct answer (evidence in favor of personhood), your beliefs should be adjusted towards believing the latter is the human. Always giving bad answers just because humans can also give a bad answer is already a failing strategy with low success rate when the test is carried out as Turing specified, with no requirement to imitate a specific characteristic added in. As an analogy that may or may not help: You have two boxes, one containing a rabbit and one containing a turtle. One box is perfectly still, offering little evidence either way (rabbits and turtles can both trivially stay still). The other box is bouncing up and down (something you have reason to believe is difficult for turtles). Which box more likely contains the rabbit? > In other words your version of the test doesn't involve solving many "X"s at all. In fact it only requires one - the one which simplifies the domain so much as possible. I think the key you are missing is that it is up against a real human. It does not just have to pass the "well both humans and bots could theoretically respond in this way" mark by giving gibberish answers, but instead get chosen as human when the second player is likely satisfying many of the interrogator's tests. |
|
LLMs are trained on nothing except the corpus of human knowledge. It is literally impossible for them to e.g. accidentally say something that it's inconceivable for a human to say, if we have absolutely no way of constraining the identity of said human. And another way these tests are made easier is by generally having all participants manually type out their interactions, instead of having them transcribed. So the dozens of interactions you'd normally have in 5 minutes goes down to ~5. Making all of this that much more true. It's just silly.
And no, always giving bad answers it not a failing strategy. As I mentioned, the scenario I'm describing is not a hypothetical. The Turing Test (or at least yet another abysmal bastardization of it) was passed in 2014, with the chatbot in question impersonating a 13 year old Ukrainian teen who was not good in English, or maintaining a coherent dialogue, or train of thought, or even being coherent for the most part. I'm sure this is exactly what Turing had in mind. [1]
[1] - https://www.bbc.com/news/technology-27762088