|
|
|
|
|
by COAGULOPATH
641 days ago
|
|
Yes, this only helps multi-step reasoning. The model still has problems with general knowledge and deep facts. There's no way you can "reason" a correct answer to "list the tracklisting of some obscure 1991 demo by a band not on Wikipedia." You either know or you don't. I usually test new models with questions like "what are the levels in [semi-famous PC game from the 90s]?" The release version of GPT-4 could get about 75% correct. o1-preview gets about half correct. o1-mini gets 0% correct. Fair enough. The GPT-4 line aren't meant to be search engines or encyclopedias. This is still a useful update though. |
|
You're using a calculator as a search engine.