Hacker News new | ask | show | jobs
by LightFog 715 days ago
Maybe I’m reading too much into it but the roadmap mentioning switching from GPT4 Turbo to 4-o and hoping for better math performance feels like they are betting on a significant near term reliability improvement in LLMs without any other real plans. That magic jump is starting to look more and more doubtful by the day.
1 comments

Also pretty important that they're using a calculator for numerical problems. That should avoid some of the most embarrassing mistakes.