Y
Hacker News
new
|
ask
|
show
|
jobs
D1: Scaling Reasoning in Diffusion LLMs via Reinforcement Learning
(
dllm-reasoning.github.io
)
4 points
by
t55
405 days ago