Hacker News new | ask | show | jobs
by bossyTeacher 486 days ago
> What matters is whether the reasoning reaches the right conclusions

no, it doesn't. a broken clock is right twice a day, reasoning is about the journey more than the destination

1 comments

RL has more than two steps...
Point is that reasoning is more about the conclusions. if your steps are wrong, your reasoning is wrong regardless of the conclusion. Poor reasoning is what could make an LLM conclude that 1 + 2 = 3 but what 2 + 1 = [some number other than 3]