Hacker News new | ask | show | jobs
by TOMDM 734 days ago
Paper for the sparcified mixtral models

https://arxiv.org/abs/2406.05955