Hacker News new | ask | show | jobs
by SirSegWit 874 days ago
Not really, right? There are a ton of special tokens, like start of sequence etc., so what happens if there are two start of sequences predicted? It's a valid token but cannot really be turned into something sensible, so it throws an error when converting tokens to plain text?
1 comments

Special tokens are handled by the application, not the model. They are still output before then.