Hacker News new | ask | show | jobs
by WithinReason 86 days ago
Some transformers have a block recurrent structure, here is a paper that made a similar observation recently:

https://www.alphaxiv.org/abs/2512.19941