Hacker News new | ask | show | jobs
by gigafuture 925 days ago
I skimmed through the book and it looks great! I'm about to buy it!

Besides this book are there any in the same league that are applicable to learn more about the diffusion and transformer model architectures?

1 comments

Jurafsky's 3rd edition draft of his NLP book and Simon Prince's DL book both have chapters on transformers, and the latter also on diffusion. Both have official free pdf versions.

https://web.stanford.edu/~jurafsky/slp3/

https://udlbook.github.io/udlbook/