Hacker News new | ask | show | jobs
by avereveard 841 days ago
So far gpt is the only one able to answer to variations of these prompts https://www.lesswrong.com/posts/EHbJ69JDs4suovpLw/testing-pa... it might be trained on these but still you can create variations and get decent responses

Most other model fail on basic stuff like the python creator on stack overflow question, they identify Guido as the python creator, so the knowledge is there, but they don't make the connection.

1 comments

>>So far gpt is the only one able to answer to variations of these prompts

You're saying that when Mistral Large launched last week you tested it on (among other things) explaining jokes?

Sorry I did what? When?
You linked to a lesswrong post with prompts asking the AI to explain jokes (among other tasks?) and said only Openai models can do it, didn't you? I'm confused why you said only OpenAI models can do it?
Ah sorry if it wasn't clear below the jokes there are a few inferring posts and so far yeah didn't see Claude or other to reason the same way as palm or gpt4, (gpt3.5 did got some wrong), haven't had time tho to test mistral large yet. Mixtral didn't get the right. Tho.