| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by katzenversteher 491 days ago
	I bet a token like "sht!", "f*" or "damn!" would have the same or even stronger effect but the LLM creators would not like to have the users read them

3 comments

raducu 490 days ago

It's literally in the article, they measured it and wait was the best token

link

ascorbic 490 days ago

Maybe, but it doesn't just use it to signify that it's made a mistake. It also uses it in a positive way, such as it's had a lightbulb moment. Of course some people use expletives in the same way, but that would be less common than for mistakes.

link

lodovic 491 days ago

I think you're onto something, however, as the training is done through on text and not actual thoughts, it may take some experimentation to find these stronger words.

link