| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by llm_trw 584 days ago
	The closest explanation to how chain of through works is suppressing the probability of a termination token. People have found that even letting llms generate gibberish tokens produces better final outputs. Which isn't a surprise when you realise that the only way a llm can do computation is by outputting tokens.