Y
Hacker News
new
|
ask
|
show
|
jobs
by
trashcan2137
79 days ago
and the EOS is "<turn|>". "<|channel>thought\n" is also used for the thinking trace!
Can someone explain this to me? Why is this faux-XML important here?
2 comments
pertymcpert
78 days ago
That’s how the model is trained to signal the end to its generation and to indicate its thinking.
link
sroussey
78 days ago
These are likely individual tokens. They are super common.
link