Hacker News new | ask | show | jobs
by E-Reverance 152 days ago
This is unrelated. They both use the word "block", but what they are referring to differs
1 comments

"Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models"
Yes and? The paper I linked is about network weights, not the type of generative model