| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by borzunov 1048 days ago
	Hi, a Petals dev here. </s> means "end of sequence" for LLMs. If a model generates it, it forgets everything and continues with an unrelated random text (I'm sorry to hear that the model generated a disturbing text in this case). Still, I doubt that malicious actors are involved here. Apparently, the Colab code snippet is just too simplified and does not handle </s> correctly. This is not the case with the full chatbot app at https://chat.petals.dev - you can try it out instead.

1 comments

Thanks for the reply. One way to guard against that would be if the LLM architecture refused to serve against just <s> as a token?