| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by avereveard 841 days ago
	So far gpt is the only one able to answer to variations of these prompts https://www.lesswrong.com/posts/EHbJ69JDs4suovpLw/testing-pa... it might be trained on these but still you can create variations and get decent responses Most other model fail on basic stuff like the python creator on stack overflow question, they identify Guido as the python creator, so the knowledge is there, but they don't make the connection.

1 comments

staticman2 841 days ago

>>So far gpt is the only one able to answer to variations of these prompts

You're saying that when Mistral Large launched last week you tested it on (among other things) explaining jokes?

link

avereveard 841 days ago

Sorry I did what? When?

link

staticman2 841 days ago

You linked to a lesswrong post with prompts asking the AI to explain jokes (among other tasks?) and said only Openai models can do it, didn't you? I'm confused why you said only OpenAI models can do it?

link

avereveard 841 days ago

Ah sorry if it wasn't clear below the jokes there are a few inferring posts and so far yeah didn't see Claude or other to reason the same way as palm or gpt4, (gpt3.5 did got some wrong), haven't had time tho to test mistral large yet. Mixtral didn't get the right. Tho.

link