Hacker News new | ask | show | jobs
by ironbound 145 days ago
"Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models"
1 comments

Yes and? The paper I linked is about network weights, not the type of generative model