|
|
|
|
|
by acqq
916 days ago
|
|
Reading the motivation for the second: "Relying on autoregressive forward passes to generate text is slow and prone to hallucination/repetition. From the nougat paper: We observed [repetition] in 1.5% of pages in the test set, but the frequency increases for out-of-domain documents. In my anecdotal testing, repetitions happen on 5%+ of out-of-domain (non-arXiv) pages." When these are fed in the next levels as inputs, isn't it even less surprising to get even more hallucinations/repetitions? |
|