Hacker News new | ask | show | jobs
by ugurnot 1201 days ago
I evaluated ChatGPT on Winogrande Debiased validation set[1], a dataset focused on commonsense reasoning. ChatGPT has an accuracy of 62.75%, below GPT-3's reported accuracy of 77.7%.

https://github.com/ugorsahin/Winogrande_ChatGPT