Hacker News new | ask | show | jobs
by koayon 843 days ago
Another interesting one is that the hardware isn't really optimised for Mamba yet either - ideally we'd want more of the fast SRAM so that we can store more larger hidden states efficiently