Hacker News new | ask | show | jobs
D1: Scaling Reasoning in Diffusion LLMs via Reinforcement Learning (dllm-reasoning.github.io)
4 points by t55 405 days ago