Hacker News new | ask | show | jobs
by estearum 83 days ago
> a departure from Mamba-2, which optimized for training speed.

?

1 comments

Yes? Mamba-2 optimized for training speed compared to Mamba-1. Mamba-3 adds optimization for inference. These are pretty much version numbers.