|
|
|
|
|
by gschoeni
921 days ago
|
|
Have been studying the Mamba architecture all week and put together my notes here: https://blog.oxen.ai/mamba-linear-time-sequence-modeling-wit... I hadn't found a very satisfying explanation of the paper yet, and still had some questions at the end, but hopefully this can give people a good jumping off point for their understanding! |
|