Y
Hacker News
new
|
ask
|
show
|
jobs
by
obmelvin
608 days ago
the o1 model definitely has a somewhat big variance in how long the task takes depending on what you ask it to do
2 comments
wkat4242
608 days ago
True the o1 model is the one exception though it's really more of a chain of LLMs. I wouldn't consider it a pure LLM.
Also, o1 still fails at many mathematical tasks which the linked article clarifies.
link
robterrell
608 days ago
You don't see the majority of tokens it is generating.
link
obmelvin
608 days ago
Yes, I'm not claiming that is true formal reasoning, but it is certainly more of a chain of thought than was previously being done and does indicate that some questions require more and less "thought"
link
Also, o1 still fails at many mathematical tasks which the linked article clarifies.