Hacker News new | ask | show | jobs
by gschoeni 921 days ago
Have been studying the Mamba architecture all week and put together my notes here:

https://blog.oxen.ai/mamba-linear-time-sequence-modeling-wit...

I hadn't found a very satisfying explanation of the paper yet, and still had some questions at the end, but hopefully this can give people a good jumping off point for their understanding!