Hacker News new | ask | show | jobs
Length-Induced Embedding Collapse in Transformer-Based Models (arxiv.org)
3 points by Wheatman 593 days ago