Hacker News new | ask | show | jobs
by tessierashpool9 168 days ago
I mean, isn't that a little ridiculous? Aren't those language models already solving complicated exam questions and mathematical problems?
1 comments

According to the creators, the models are on a phd level of intelligence, but they can’t get the simplest thing right.
Overselling is only the tip of the iceberg. The real problem is that a lot of managers base their decision to introduce language models into business processes on cutting edge Pro edition demos, but what is, of course, actually used in production is some cheap Nano/Flash/Mini version.
Too easy.