And again, people think they’ve fooled a generative model by forcing it to do calculations in English when allowing it to write code is way more reasonable and accurate. That this is “news” or interesting is laughable.
Well, it is arguable that it is "laughable" that there is a strong push to try and regulate/contain these systems to avoid human extinction when they can't tell you the number of weekdays in a month (and fail in this manner).
I'm no AI skeptic (and am aware the above does not seem to apply to GPT4), but the general point stands - finding these limitations is an interesting endeavor, and when they're particularly severe or unexpected, newsworthy.
If LLMs cause most 'well ackshually...' replies to be aimed at LLMs instead of humans, I think it's probably a small win for humanity. Let them have their smug satisfaction at outwitting some python code.
I'm no AI skeptic (and am aware the above does not seem to apply to GPT4), but the general point stands - finding these limitations is an interesting endeavor, and when they're particularly severe or unexpected, newsworthy.