| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by latexr 617 days ago
	> but shows that using CoT prompt does improve llm responses. A wrong answer is a wrong answer. In one of the questions it failed exactly in the same manner that GPT-4o did when I asked, so it’s not clear at all this is better. I could even see the chain and identify exactly where it made the mistake, but that’s not really a consolation.

2 comments

hayden_k 617 days ago

As I said - it’s not perfect at answering every question right. What I am saying is that CoT promoting does have an effect on the quality of LLM responses. Ask how many r in strawberry or a similar question to t1 and llama 3.1 and you will see that CoT strategy has some effect.

link

hayden_k 617 days ago

Also to be clear - I never claimed that t1 is better than gpt 4o and o1, but thank you for trying it and providing feedback :)

link