|
|
|
|
|
by yugretcx
221 days ago
|
|
Why do these text diffusion demos always look like the number of allowed tokens is fixed for a specific unfilled region? Is this the case? Ie. if the region only has four tokens(here characters) but calculates the best word is “forget” does it just abandon the best fit or truncate it to fit? Are there text diffusion models with lax infill directives? |
|
So yes, you define a sequence of [MASK] tokens with some length ahead of time.
In practice, if a model wants to write a shorter sequence, it'll just fill the remaining tokens with empty content. If it wants to write a longer sequence, you'll have to identify this and extend the sequence with more [MASK] tokens. This is typically obvious since there's no "end of sequence" token present if the model wants to generate more.