You will see that tokens not predicted by greedy sampling of the target model are rejected. Ergo, they are mathematically identical.
You will see that tokens not predicted by greedy sampling of the target model are rejected. Ergo, they are mathematically identical.