Hacker News new | ask | show | jobs
by latexr 617 days ago
> but shows that using CoT prompt does improve llm responses.

A wrong answer is a wrong answer. In one of the questions it failed exactly in the same manner that GPT-4o did when I asked, so it’s not clear at all this is better. I could even see the chain and identify exactly where it made the mistake, but that’s not really a consolation.

2 comments

As I said - it’s not perfect at answering every question right. What I am saying is that CoT promoting does have an effect on the quality of LLM responses. Ask how many r in strawberry or a similar question to t1 and llama 3.1 and you will see that CoT strategy has some effect.
Also to be clear - I never claimed that t1 is better than gpt 4o and o1, but thank you for trying it and providing feedback :)