Hacker News new | ask | show | jobs
by smus 307 days ago
Can you explain how CoT is a form of diffusion or models bidirectional attn?