| > If this were hobbyist awe, none of those benchmarks or red-teaming efforts would exist. Absolutely not true. I cannot express how strongly this is not true, haha. The tech is neat, and plenty of real computer scientists work on it. That doesn't mean it's not wildly misunderstood by others. > Concluding from those failure modes that this is just Clever Hans is not adversarial engineering. I feel like you're maybe misunderstanding what I mean when I refer to Clever Hans. The Clever Hans story is not about the horse. It's about the people. A lot of people -- including his owner-- were legitimately convinced that a horse could do math, because look, literally anyone can ask the horse questions and it answers them correctly. What more proof do you need? It's obvious he can do math. Except of course it's not true lol. Horses are smart critters, but they absolutely cannot do arithmetic no matter how much you train them. The relevant lesson here is it's very easy to convince yourself you saw something you 100% did not see. (It's why magic shows are fun.) |
These things are not horses. How can anyone choose to remain so ignorant in the face of irrefutable evidence that they're wrong?
https://arxiv.org/abs/2507.15855
It's as if a disease like COVID swept through the population, and every human's IQ dropped 10 to 15 points while our machines grew smarter to an even larger degree.