Hacker News new | ask | show | jobs
Mechanics of Next Token Prediction with Self-Attention (arxiv.org)
1 points by convexstrictly 819 days ago