Hacker News new | ask | show | jobs
Efficient Pre-Training with Token Superposition (nousresearch.com)
2 points by pyinstallwoes 29 days ago