https://news.ycombinator.com/item?id=48529544
Anyways SwiTransformer paper looks interesting and doing a post training to optimize for it looks interesting as well.