Hacker News new | ask | show | jobs
Universal Transformers Need Memory: Depth-State Trade-Offs in Adaptive Recursive (arxiv.org)
1 points by che_shr_cat 49 days ago