Hacker News new | ask | show | jobs
by gok 117 days ago
Do you use fully bidirectional attention or is it at all causal?