| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mountainriver 982 days ago
	Just because it fails one test in a particular way doesn’t mean it lacks reasoning entirely. It clearly does have reasoning based on all the benchmarks it passses You are really trying to make it not have reasoning for your own benefit

1 comments

graynk 982 days ago

> You are really trying to make it not have reasoning for your own benefit

This whole thread really seems like it's the other way around. It's still very easy to make ChatGPT to spit out obviously wrong answers depending on the prompt. If it had actual ability to reason as opposed to just generating continuation to your prompt, the quality of the prompt wouldn't matter as much

link

mountainriver 976 days ago

Then why does it do so well on all the reasoning benchmarks?

link