Hacker News new | ask | show | jobs
by lyfeninja 13 days ago
Below is the "Attention is all you need" paper. Transformers and their attention mechanism was the major breakthrough for modern LLMs. ML has been around for a long time, I'd suggest joining kaggle or something and learn by doing. You'll retain more and realize how broad the category is anymore.

https://arxiv.org/abs/1706.03762