Hacker News new | ask | show | jobs
by cloudhan 919 days ago
Might be the training code related with the model https://github.com/mistralai/megablocks-public/tree/pstock/m...
1 comments

Mixtral-8x7B support --> Support new model

https://github.com/stanford-futuredata/megablocks/pull/45