Y
Hacker News
new
|
ask
|
show
|
jobs
by
ironbound
152 days ago
Other teams did a better job and provided code
https://github.com/kuleshov-group/bd3lms
1 comments
E-Reverance
152 days ago
This is unrelated. They both use the word "block", but what they are referring to differs
link
ironbound
151 days ago
"Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models"
link
E-Reverance
151 days ago
Yes and? The paper I linked is about network weights, not the type of generative model
link