Hacker News new | ask | show | jobs
“Attention”, “Transformers”, in Neural Network “Large Language Models” (bactra.org)
2 points by iflp 1074 days ago